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Abstract 

We consider several variants of the following problem: pick annxn matrix 
from some unitary ensemble of random matrices. In an interval containing 
many eigenvalues, what is the closest spacing between two eigenvalues? We 
are interested in the correct scaling for this random variable as n — > oo, and 
the limiting distribution of the rescaled random variable. 

One can predict the distribution of the minimum spacing heuristically by 
assuming that the consecutive spacings are independent random variables, 
chosen from the consecutive spacing distribution for unitary ensembles. The 
consecutive spacings are not independent. However, in all cases studied, the 
heuristics predict asymptotically the correct scaling and distribution of the 
minimum spacing. 

Using the method of moments, we show that the number of eigenvalue pairs 
in a given interval and closer than a small distance 7, is approximately a 
Poisson random variable with mean as predicted by heuristics. Varying 7, 
we obtain the result for minimum spacing. 

In the most concrete special case, our Main Theorem is this: 

Theorem 0.1 (Main Theorem). Choose a random n x n unitary matrix. 
Let Z n be the closest spacing between any two eigenvalues. Fix /3 > 0. Then 
as n — > 00, 

The correct scaling for the closest eigenvalue spacing is n _1 (n|/ n |) _ 3 . In 
this case, n~ l is comparable to the mean spacing and n\I n \ is comparable to 
the expected number of eigenvalues in the interval being considered. On the 
other hand, if the eigenvalues were independent and distributed according 
to a Poisson process, then the closest spacing would scale like n~ l {n\I n \)~ l , 
which is much closer. Thus we have another confirmation of what is often 
said: "The eigenvalues of a unitary matrix repel each other" . 

Our results also apply to the Gaussian unitary ensemble (GUE) and, with 
restrictions, to a universal class of unitary ensembles (UUE) studied by Deift, 
Kreicherbauer, McLaughlin, Venakides, and Zhou. In each of these cases, 
the expected number of eigenvalues in the interval must be large, and the 
interval must be contained inside the bulk distribution of the eigenvalues. 
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1. Introduction 



We consider several variants of the following problem: pick annxn matrix 
from some unitary ensemble of random matrices. In an interval containing 
many eigenvalues, what is the closest spacing between two eigenvalues? We 
are interested in the correct scaling for this random variable as n — > oo, and 
the limiting distribution of the rescaled random variable. 

One can predict the distribution of the minimum spacing heuristically by 
assuming that the consecutive spacings are independent random variables, 
chosen from the consecutive spacing distribution for unitary ensembles. The 
consecutive spacings are not independent. However, in all cases studied, the 
heuristics predict asymptotically the correct scaling and distribution of the 
minimum spacing. 

Using the method of moments, we show that the number of eigenvalue pairs 
in a given interval and closer than a small distance 7, is approximately a 
Poisson random variable with mean as predicted by heuristics. Varying 7, 
we obtain the result for minimum spacing. 

Our results apply to the circular unitary ensemble (CUE), the Gaussian 
unitary ensemble (GUE), and with restrictions to a universal class of unitary 
ensembles (UUE) studied by Deift, Kreicherbauer, McLaughlin, Venakides, 
and Zhou. In each of these cases, the expected number of eigenvalues in the 
interval must be large, and the interval must be contained inside the bulk 
distribution of the eigenvalues. 

1.1. Statement of the Main Theorem. The Main Theorem is easi- 
est to state in the case of the circular unitary ensemble (CUE), which is the 
compact group U n with Haar measure. 

Theorem 1.1. For each n, let I n C [0, 2ir) be an interval so that n\I n \ — > 00 
as n — ^ 00. 

Choose a matrix randomly from CUE n and let Z n be the closest spacing 
between any two eigenvalues whose average is in I n . Fix f3 > 0. Then as 
n — > oo, 



Of particular interest is the case when /„ = [0, 2tt) for all n: 

Theorem 1.2 (Main Theorem, CUE Version). Choose a matrix randomly 
from CUE n and let Z n be the closest spacing between any two eigenvalues. 
Fix /3 > 0. Then as n — > 00, 
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Given Theorem 1 1 . 2 1 and its Corollary, observe that the correct scaling for the 
closest eigenvalue spacing is n _1 (n|/ n |) _ 3 . In this case, n _1 is comparable 
to the mean spacing and n\I n \ is comparable to the expected number of 
eigenvalues in the interval being considered. On the other hand, if the 
eigenvalues were independent and distributed according to a Poisson process, 
then the closest spacing would scale like n (n\I n \) , which is much closer. 
Thus we have another confirmation of what is often said: "The eigenvalues 
of a unitary matrix repel each other" . 

The universal unitary ensembles (UUE) are not completely standard. Given 
a real analytic potential V(x) with sufficient growth at infinity, UUE n is the 
ensemble of Hermitian matrices with the following joint probability density 
function (j.p.d.f.) for the matrix entries: 

(const) e - n ^ v{x ^dM 

Using Weyl integration (Appendix [T5|) . the j.p.d.f. for the eigenvalues is 
(const) Y[{Xi- Xj) 2 e- n ^ v ^dA 

i<j 

One could recover the Gaussian Unitary Ensemble (GUE) as a special case of 
UUE by choosing the potential V(x) = x 2 and rescaling properly. The word 
"unitary" in this context confuses many newcomers to the field of random 
matrices. The GUE is a probability distribution on the set of Hermitian 
matrices which is invariant under conjugation by any unitary matrix. 

Our main result, in the case of universal ensembles, is the following: 

Theorem 1.3 (Main Theorem, universal version). Let V(x) be a real ana- 
lytic potential which is regular and whose equilibrium measure ^(x)dx is sup- 
ported on a single interval [a, b] . Fix e > 0. For each n, let I n C [a + e, b — e] 
be an interval contained in the bulk distribution of eigenvalues, so that 
n\I n \ — > oo as n — > oo. 

Choose a matrix randomly from UUE n and let Z n be the closest spacing 
between any two eigenvalues whose average is in I n . Fix (3 > 0. Then as 
n — > oo, 

Pr (z n jf *( x ) 4 dz) 3 > ^ -+ e-?\ 



1.2. Overall Strategy. We analyze the random variable Z n indirectly. 
Let Gry t i tn count the number of eigenvalue pairs whose average is in I n and 
whose separation is at most 7. For example, if Ai < A2 < A3 < A4 < 
Ai + 7, then this contributes 6 pairs to G 7i / jn . For each 7 we analyze the 
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random variable G 7 / n in detail, but what interests us most is the probability 
Pr(G 7)J , n = 0): 

(Z n > 7) (G 7 , 7 , n = 0) 

Pr(Z„ > 7 ) = Pr(G 7)7 , n = 0) 

Since G 7) / ;n is an integer valued random variable, Pr(G 7i / jn = 0) is accessible 
from the moments of G 7i / in . The following Theorem says that the moments 
of G-y,i,n are approximately the moments of a Poisson distribution. 

Theorem 1.4 (Moment Estimation Theorem, universal version). Let V(x) 
be a real analytic potential which is regular and whose equilibrium measure 
ty(x)dx is supported on a single interval [a,b]. Fix e > 0. For each n, 
let I n C [a + e, b — e] be an interval contained in the bulk distribution of 
eigenvalues. 

For each n, let 7„ > 0. Let G lt i tn be the random variable which counts the 
number of UUE n eigenvalues whose average is in I n and whose difference 
is at most j n . Let G n be the Poisson distribution with mean 

9 Jin 

Then for all k > 1, as n — >■ oo, 

E{G\ hn ) = E(Gl) (l + O (V 1 + 7 2 n 2 + ( 7 2 n 2 ) 3 + (n|/|)"S)) . 
The constant implied by O depends only on k, the potential V(x), and e. 

Proving the Moment Estimation Theorem is the major task of this paper. 
First, let us assume the Moment Estimation Theorem and use it to prove 
the Main Theorem. 

1.3. Proof that the Moment Estimation Theorem implies the 
Main Theorem. In applying the Moment Estimation Theorem, we get to 
choose 7„. We choose: 

oc n 3\L n \ 3. 

With this choice of j n , fi n = /3 3 for all n. Applying the Moment Estimation 
Theorem, 

£(G 7)J , n ) = S(Gj 3 )(l + o((n|/ n |)-l)). 

By assumption, (n|7 n |) — > oo as n — > oo, so the moments of G 7) / jn converge 
to those of Gp3. Thus, 

Pr(G 7 ,j, n = 0) -»• Pr(G /3 3 = 0) = e~^ as n -> oo. 
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We retrace our steps back to the minimum spacing Z n . The random variable 
Z n is greater than 7 n if and only if GL j n = 0. Substituting in the chosen 
value of 7 n , this is equivalent to 

Zn {^rl q,{x)4(lx ) 3 > /3 ' 

completing the proof. 



1.4. Outline of Paper. In the remainder of this paper we prove the 
Moment Estimation Theorem. We first prove the case of CUE because this 
case is by far the easiest. The novel calculations for CUE are 

• The combinatorial methods used enumerate the various contribu- 
tions to the moment E(G~ t i tn ). Each contribution is expressed as 
an integral using the method of Gaudin. 

• Calculation of the main contributions using asymptotics for the 
projection kernel K n (x,y) and its derivatives. 

• Bounds for the undesired contributions, which are implied by bounds 
for K n (x,y) and its derivatives. 

For CUE, the projection kernel is very easy to work with: 

1 e in(x-y) _ i 

K n (x,y) = -, > . 

See Appendix 114.41 

We prove the UUE version of the Moment Estimation Theorem using exactly 
the same techniques. Parts of the proof are identical to the CUE case, and 
are omitted. The new feature the UUE case is that the projection kernel 
K n (x,y) is less easy to work with: 

n 

K n (x,y) = ^2vj{^)Vj(y) 

I .sVn(x)Vn-l(y) ~ Vn(y)Vn-l{x) 

= (const) . 

x-y 

where 4>j{x) is the (j')th normalized orthogonal polynomial with respect to 
the weight e~ nV ^ and r]j(x) = e~z v ( x >(j)j(x). 

The asymptotics for the orthogonal polynomials </> n -i and (f) n were obtained 
recently b y Deift, Kre icherbauer, McLaughlin, Venakides and Zhou. Based 
closely on [D KM+97] and [Dei99;, we outline the derivation of the leading 
order asymptotics. By a very minor modification of these techniques, we ob- 
tain the leading order asymptotics for the derivatives of r\ n -\ and r/ n - which, 
as one would expect, are the derivatives of the leading order asymptotics. 
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The case of the Gaussian unitary ensemble is of special interest. The analogs 
of the Main Theorem and the Moment Estimation Theorem could be ob- 
tained by rescaling the GUE and applying the universal results for the po- 
tential V(x) = x 2 . We present an alternate proof of the GUE Theorems, 
using Plancherel-Rotach asymptotics for Hermite polynomials in place of 
the more general Deift-Zhou asymptotics for orthogonal polynomials. 

Given our results about the closest spacing for eigenvalues in an interval, it 
is natural to ask about the distribution of the maximum spacing between 
consecutive eigenvalues in some interval. We have not studied this problem 
in great detail, but believe it is more difficult than the minimum spacing 
problem. At least we have not been able to deal with it yet. In section [T5l 
we outline one possible approach to studying the maximum spacing. 

We call the reader's attention to [Ede92j and to related papers available on 
Edelman's web page. In these paper Edelman finds the correct scaling and 
distribution of the condition number 

|Ai| ' 

where Ai is the complex eigenvalue with smallest absolute value. The con- 
dition number is an indicator of the difficulty of inverting a matrix numer- 
ically. Edelman considers the ensemble of matrices whose real or complex 
entries are chosen independently from Gaussians. Since the numerator is 
very tightly distributed because of the law of large numbers, Edelman's re- 
sult concerns the distribution of the smallest eigenvalue for a matrix in this 
ensemble. 



2. Case 1: The Circular Unitary Ensemble (CUE) 

We now state and prove the Moment Estimation Theorem for the circular 
unitary ensemble. The Moment Estimation Theorem for the CUE case 
implies the Main Theorem for the CUE case, which was already stated in 
the introduction. 

Theorem 2.1 (Moment Estimation Theorem, CUE version). For each n, 
let 7„ > and I n C [0, 2ir). 

Let G 7j / jn be the random variable which counts the number of CUE n eigen- 
values whose average is in I n and whose difference is at most 7 n . Let be 
the Poisson distribution with mean 

_ \ I n\lln A 

Mn ~ 1447T 2 

Then for all k > 1, as n — >• oo, 

E(G* >I>n ) = E(G*) (l + O (V 1 + 7 V + ( 7 V) 3 + (n|J|)-§)) . 
The constant implied by O depends only on k. 







Before proving this Theorem, we will show that it agrees with heuristic 
predictions. 



2.1. Heuristic Prediction Based on the Consecutive Spacing 
Distribution. We make the simplifying assumption that the consecutive 
spacings are independent random variables, all chosen from the consecutive 
spacing distribution for unitary ensembles. This assumption of independence 
is false, since adjacent consecutive spacings are anti-correlated. See p. Ill 
of [Meh9l] for a contour plot of the joint probability density function of 
two adjacent consecutive spacings. Observe that short spacings tend to 
be followed by longer ones and vice-versa. However, we will see that the 
independence assumption does lead to correct predictions for the minimum 
spacing. 

For the unitary ensembles, the consecutive spacing distribution vanishes to 
order two at the origin, so that very short consecutive spacings are unlikely 
and the eigenvalues are said to "repel." See [Meh9l] Chapter 5 and Appen- 
dix 13. For a power series expansion of this density function at the origin, 
the first few terms are: 

. vr 2 2 2vr 4 4 vr 6 6 
p{x) = -x -—x +—x 



When a matrix is chosen from U n , the mean spacing of the eigenvalues is the 
constant independent of 6. Thus a separation of 7 is equal to (%£:') times 
the mean spacing. When (^30 is small, the probability of any one of these 
spacings being less than 7 is about (^p) (^) 3 - The number of consecutive 

spacings is about so the expected number of consecutive spacings less 

than 7 is about 

= n\I n \ ( vr2\ /^7\ 3 _ \I n \n A ~il 
^~ 2vr V 9 / ^2vr/ ~~ 144vr 2 

Since GL i >n is a sum of many independent unlikely events, it is approxi- 
mately Poisson with mean [i. 



3. The First Moment of G 7) / . 



Recall that G~/ )n is the number of GUE-n eigenvalue pairs whose average is 
in /„, and whose difference is less than 7. Thus G 7) / )n is the symmetrization 
to n variables of a function of 2 variables: 

{- if \u- t\ < 7 
and ^ € / 
otherwise 

G-y,i,n(h,h, ■ ■ ■ ,t n ) = y~]g(tj,tj). 
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Using Theorem 114.51 the expected value of G 7) i >n is 



E(G"f,I,n) 



[0,2tt)2 



g(u,t) 



K n (u,u) K n (u,t) 
K n (t,u) K n (t,t) 



dudt 



K n (u,u) K n (u,t) 
K n (t,u) K n (t,t) 



dudt. 



In the above formula, the region O is the set where 52(^1^2) = 



n 



(u,t) s.t. 



U±t f- T 

2 

\u — t\ < 7 



The projection kernel K n (u,t) for U n is 

1 \ e in(u-t) _ ^ 
2^) e i{u-t) _ l 

K n (u - t) 



K n (u,t) 



Since (t — u) is small in the region 0, we change variables and expand the 
determinant as a power series. We let x = y = ^f--, and dxdy = ^dudt. 



In 



b 
b 



K n (0) K n (2y) 
K n (-2y) K n (0) 



dudt 



% 2 ((^(0)) 2 -K(0)i^"(0)) 

1 +0(y 4 coC4 + y 5 ciC4 + --- + 2/ 8 c 4 C4) y ' 

2 ' 

where Cj is an upper bound for the (j)th derivative of K n . We apply the 
following Lemma: 

Lemma 3.1. Consider the kernel j^r e e ixZ\ ■ F° r x > xq > ^(mod 2tt), the 
(k)th derivative of this kernel is 

o(-n k 

Without the restriction on x, the derivative is Q(n k+1 ) 



Proof. Consider taking k derivatives symbolically. After taking < 
j < k of these derivatives, the result is a sum of terms of the form: 

monomial(n, e inx , e lx ) 
(e ix - l) 1 ' 

At each step, the worst that can happen is that either the exponent in the 
denominator increases by 1, or the power of n in the numerator increases 
by 1. Since we assumed that xq > -, the worse of these two outcomes is 
multiplying by n. The result follows by induction, and observing that the 
number of terms is finite, depending only on k. 
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For the bound without the restriction on t, express the kernel as a sum of 
complex exponentials and differentiate termwise. □ 



Returning to our expression for E(F min ), 

Jin J-\~1 

= 4^1 /f 7 V(-^ + Y + O(n 3 ))^ + O(|I| 7 5 n 6 + 7V 



1447T 2 

This agrees with the heuristic predictions in Subsection 12,1 

4. The Higher Moments of G 7 / ra 

The (fc)th power of Gy > i >n (ti,t2, ... ,t n ) may be written 



k 

3 J 



j¥=3 



9(ti! , tji )g(ti a ,t h )... g(t ik , t jk ), 



where 

2 



g(u,t) = < 



( ! if \u — t\ < 7 
and **±£ G I 



otherwise 



The indexing set for the sum on the right is not of the form required for 
Theorem 114.51 However it may be written as a disjoint union of indexing 
sets of the proper form. 



4.1. The Combinatorics of Collapses. Let us, for example, parti- 
tion the indexing set i\ / / j2',h / 33 into smaller indexing sets of 
the proper form. We write this indexing set in shorthand as 



Nil) A (i 2 |j 2 ) A (i 3 |j3), 
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and expand as follows. Except for the subtle distinction between "|" and 
"tf\ each step below should be clear. 

(*i|ii) A (i 2 |j2) A (i 3 |j 3 ) 
= (h\h) {{n\h^\h) v {i2,h\32\tt) v {h,h\j2,h) v {i2,h\h\h) V {i2,h\j2,h)) 
= {{{ix\h^h\32\\h\h) v (*i I Ji , *2 b'3 ) v (ii|ii,j2 1*2^3^3) 

V (*2 1 Jl , *3 b"3 1)*2 1 J2 ) V («1 1 J 1 , J*3 ] »3 1]*2 1 J2 ) ) 

V {{il,h\h\h^h\h) V («1 , *2 1 Jl , J2 1]*3 1 J3 ) v {il,i2\h,h\j2\h) v {il^jl, h^h)) 

v ((n,j2|ji|«2^3|j3) v (n,j2|j'i,«2^3b3) v (ii,32\ji,h\i2\h) v (n, J2b'l, h\l2\h)) 

V {{h,h\h\h\i2\h) V (H^3b'l,«2b'2b'3) V (il,l3b'l, J2K2IJ3) V (*1 , *3 1 Jl , J3 ll*2 b"2)) 

V ((«i , J3 b'i 1*3 ll*2 b'2) V (il,i3bl,*2b"2|*3) V (h , h \jl , J2 \h \h) V O'l , J3 1 jl , hW\ 32))) 

V (13 terms) V (7 terms) V (13 terms) 

V ((il|il^2,j3|j2,«3) V (i\\j\,h,h\32,h) V (ilb'l, J2,«3|«2, j'3) v (ii,i3,j3b'ib"2,i3) 

V(il, i 2 , J3|j'l, J2, ^3) V (il, j2,*3b'l 1*2, J3) V (n,i2,«3|jl,«2,j3)) 

Each term above is an indexing set of the form required for Theorem 114.51 
We call such indexing sets "collapses." Let us take one of the more compli- 
cated collapses above and explain it's meaning: 

b'i , *3 b"3t;*2 b*2) 

This means that j\ = 13, but otherwise the indices h,ji, 12, 32-, 33 are 
distinct. There are 53 indexing sets in the full expansion, but we only list 
28 of them above. The symbols "|" and "t|" both separate the i's and j's into 
equivalence classes. The symbol \ plays an additional role which will not 
be clear until later in this section. If i\ and ji are in different equivalence 
classes, then we say that these two equivalence classes fall into the same 
cluster. The clusters are separated by \. 

Every cluster contains at least two equivalence classes. If any cluster in 
a collapse contains more than two equivalence classes, we call the collapse 
a mixed collapse. If every cluster contains exactly two equivalence classes 
we call it a clean collapse. We will show that the mixed collapses make a 
negligible contribution to E(G^ In ). 

There are 42 mixed collapses for k = 3, which we do not list explicitly. 
The 11 clean collapses for k = 3 are: 

[(nbiNj2Nj3)] 

[(n,«2b'l, J2teb3) V J2|il,«2^3b"3)] 
[(*1 , *3 bl , J3 ll*2 |j2 ) V (n,j3|jl,j3^2|j2)] 

[(n|j'i^2,i3|j2,«3) v {i\\3i\\i2yk\hih)\ 

{h,h,h\j\,32,h) V {i\,32,h\3iiiz,3z) 
V (ii, 12, 32J3) V {ii,32,h\h,i2,k) 

In the list above, we have collected those indexing sets together so that each 
collection corresponds to one way of partitioning {1, 2, 3}. 



(1|2|3) 
(12|3) 
(13|2) 
(1|23) 

(123) 
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The following Lemma is evident, based on consideration of the expansion of 
(h\ji) A (^2 1 J2) A (23IJ3) into collapses which we considered above. 

Lemma 4.1. When the indexing set (*i |ji ) A (I2 1 J2) A • • -A(ik\jk) is expanded 
into collapses, every cluster contains at least two equivalence classes. Clean 
collapses with I clusters arise from a partition of the integers {1, 2, . . . , k} 
into I nonempty subsets. Each of I subsets of {1,2 ... ,k} leads to a cluster of 
equivalence classes of {i\, ji, 12, J2, • • • > ik,jk}- Each partition of {1, 2, . . . , k} 
into I equivalence classes corresponds to 2 k ~ 1 different clean collapses with 
I clusters. 

We will see that clean collapses make the dominant contribution to E(G k j n ). 
For each clean collapse, the "block diagonal" id the dominant contribution. 
Using a combinatorial analysis, we'll see that the sum over all clean collapse 
block diagonal terms is approximately equal to E(G k ), in agreement with 
Theorem E3J 



4.2. Estimating the clean collapses. As an example, we select one of 
the clean collapses from l n , use Theorem ll4.5l to express its contribution 
to E(G^ j n ) as an integral, and estimate that integral. We select: 

(n,j2|jl,^2^3|j3)- 

Making the substitutions ji = i\ and ii = ji, this indexing set makes the 
following contribution to E(G^ j n ): 

k,ji,k,j3 
distinct 

Since g takes only the values or ^ and is symmetric, g(ti 1 ,tj 1 )g(tj 1 ,ti 1 ) = 
2"S(*ii)*ji)- The contribution to E(G^ l n ) simplifies to: 

2 Yl 9(th,tji)g{ti 3 ,tj 3 ) 

k,ji,i3,j3 
distinct 

This is the symmetrization of a function of four variables, so Theorem 114.51 
expresses the expected value of this sum as an integral involving four vari- 
ables: 
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E 



\ 



V 



£ 

distinct 



J 



1 

2 J J4 

Notice that git^ , tj 1 )g(ti 3 , tj 3 



g(ui,h)g(u 3 ,t 3 ) [4 x 4] duidtidu 3 dt 3 



takes only the values and i. Instead of 



including g(ti 1 ,tj 1 )g(ti 3 ,tj. i ) in the integrand, we incorporate the constant \ 
and restrict the region of integration: 

if n (ui,ui) K n (ui,ti) K n (ui,u 3 ) 



1 



K n (ti,ui) 
K n {t 3 ,ui) 



K n (u 3l ti) 



K n (ti,u 3 ) 
K n (u 3 ,u 3 ) 
K n (t 3 ,u 3 ) 



K n {ui,t 3 ) 
K n (t u t 3 ) 
K n (u 3 ,t 3 ) 
K n (t 3 ,t 3 ) 



du\dt\du 3 dt 3 



^ — ^-T, (i 1 ,J2 |jl ,»2 11*3 1 J3 ) ~~ * 



(ui,ii,u 3 ,i 3 ) 



pi — t\\ < 7 

«3+k g J 



1^3 - *3| < 7 

We see for the first time the meaning of clustering the equivalence classes. 
The variables u±,ti, u 3 ,t 3 each represent one of the equivalence classes in 
{h,32\ji, ^2^3 bs)- The equivalence classes corresponding to u\ and t\ are 
in the same cluster and so the region of integration 0, carries the restriction 
that |ui — ii| is small. Similarly, \u 3 — t 3 \ is small because the equivalence 
classes i 3 and j 3 are in the same cluster. 



4.3. The main contribution from the clean collapses. Of the 24 = 

4! terms in the determinant, the ones which make a significant contribution 
to the integral are the four terms on the block diagonal. Their contribution 
is: 



1 



K n {ui,ui) 
K n {ti,ui) 







-E(G y j >n y 



I4|n 4 7 3 

1447T 2 



K n (ui,ti) 









K n (u 3 ,u 3 ) 
K n (t 3 ,u 3 ) 






K n (u 3 ,t 3 ) 
K n {t 3 ,h) 



du\dt\du 3 dt 3 



K n (u,u) 
K n (t,u) 



K n (u,t) 

K n {t,t) 



dudt 



1 + O - + n z Y + (n z j 
n 



2„,2\3 



For a moment, let us take for granted that the block-diagonal terms of the 
clean collapses are the dominant contributions to E(G^ 7 n ). Using identical 
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techniques to the ones above, the block diagonal terms of a clean collapse 
of A (I2U2) A • • • A (ik\jk) into I clusters contributes 

k-l 

According to Lemma 14. 1\ for every partition of {1,2, ... ,k} into I equiva- 
lence classes there correspond 2 k ~ l clean collapses with I clusters. Thus each 
partition of {1,2,..., k} into I equivalence classes contributes E(G^j :n ) 1 to 
the expected value E(G k 2 n ). 

In the following Subsection, we show that each partition of {1, 2, . . . ,k} con- 
tributes fJ to the moment E(G k ). Taking for granted that the block diagonal 
terms of clean collapses are the main contribution and expecting other error 
terms to sneak in, we have recovered the conclusions of Theorem 12.11 

4.4. Moments of the Poisson Distribution. The moments of a 
Poisson distribution are given by polynomials in the first moment with non- 
negative integer coefficients: 

E(G Ji i in ) = /i 

E(Gl Iin ) = /i + /x 2 

E{G\ I)n ) = ^ + 3^ + ^ 

E(G* tI>n ) = fj. + 7/j 2 + 6/x 3 + // 

i 

i=i 

Lemma 4.2. The an- satisfy the following recurrence relation and initial 
conditions: 

01,1 = 1 

ai,k = for k^l 

a-i,k = + ai-i,k-l for i > 1. 

The CLij have a combinatorial description: a^^ is the number of ways of 

partitioning {1, 2, 3, . . . , i} into k nonempty subsets. 

Proof. Define the a^fc by the stated recurrence relation and initial con- 
ditions. 

Suppose that X is Poisson with mean [i. Then 

k 

Pr(X = k) = ^-e - ". 

k\ 

Using the above formula one sees that 

fjP = E (X(X -l)(X-l)...(X-j + 1)) 
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Let the coefficients V % be the result of expanding the above in terms of 
moments: 

= XV£(Gv >n ) 

i 

The W l are specified by the following recursion and initial conditions: 

= 1 

= for j ^ 1 
6?'.* = y'-M-i _ (j _ ljy'- 1 .* 

The following is true for fe = l,j > 1, and for > 1, j = 1: 

i 

Assume for the purposes of induction that the above is true for all smaller 
values of j and k. The following symbolic manipulations, which use the 
Einstein summation convention, comprise the inductive step. 



V'oi.k = - (j - l)^" 1 '*) a i>k (fcai-i,* + Oi_i >fc _i) 

= (fcoi-i.fc + ai_i )fc _i) - (j - l)^ -1 '*^,* 

= V- 1 '*- 1 *-!,*-! + kV-^Oi-^k - (j - l)<5f 1 

= ci + ^r 1 - u - i)c 1 

= Cl 

= ^ 

We have now shown that 

k 

J2b k ' i a i , j = S(j = k) 
i=i 

for all k > 1. This equation uniquely determines the ajj. This equation is 
satisfied if and only if 

fi k = ^2 b k,i mi 

= E(X(X -l)(X -l)...(X -k + 1)), 

where m; = X^}=i a ijtf ■ ^ 

5. The Errors EfTecting Higher Moments of G 7 j jn 

There are two types of contributions to E(G^ 1 n ) which we have not yet 
considered. We have yet to consider the off block diagonal terms of the 
clean collapses, and the mixed collapses. Since the block diagonal terms 
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of clean collapses contribute approximately E(G^), which is what we want, 
these remaining terms are "errors" . Our main tool in bounding these errors 
will be Lemma 13. 11 



5.1. Controlling the Off Block Diagonal Terms. We now estimate 
the off block diagonal terms. For this, we perform row and column opera- 
tions on the matrix which leave the determinant unchanged. Specifically, we 
subtract each odd column from the following even column, and subtract each 
odd row from the following even row. In addition to leaving the determinant 
unchanged, this operation does not effect which terms in the determinant 
are on the block diagonal and which are off block diagonal. 



duidtidu 3 dt 3 



K n (u\,t\) K n (ui,u 3 ) K n (ui,t 3 ) 

K n (ti,ui) ^ K n (h,u 3 ) K n (tx,t 3 ) 
K n (u 3 ,ui) K n (u 3 ,ti) ^ K n (u 3 ,t 3 ) 

_ K n (t 3 ,ui) K n (t 3 ,tx) K n (t 3 ,u 3 ) £ 

K n {t llUl ) - £ % - K n { Ul ,h) - K n (t uUl ) 

K n (u 3 ,ui) K n (u 3 ,ti) - K n (u 3 ,ui) 

K{t 3 , Ul ) - K{u 3 , ui) K(t 3 , h) + K(u 3 , ui) - K{u 3 , i x ) - K(t 3 



K n (ui,u 3 ) 
K(ti,u 3 ) - K(ui,u 3 ) 



2?r 



K n (t 3 ,u 3 )-£ 



K n (ui,t 3 ) - K n (ui,u 3 ) 
K(h,t 3 ) + K(ui,u 3 ) - K{ui,t 3 ) - K(ti,u 3 ) 
K n (u 3 ,t 3 )-% 
|f - K n (u 3 ,t 3 ) - K n (t 3 ,u 3 ) 



du 



To estimate the individual entries in the matrix in the integrand, we use 
Lemma |3. II in conjunction with a Taylor series expansion of K n . 



5.2. Dividing ft into Two Regions. We first divide the region £1 into 
two regions. In the first region £l a , at least one pair of the variables ui and 
u 3 are closer than a, where a » (-). In the latter region , no pair of 
variables among ui and u 3 are closer than a: 



a 



a- 



(m,ti,u 3 ,t 3 ) 



(m,ti,u 3 ,t 3 ) 



ttl+tl c J 

2 ^ 1 

\ui — ti\ < 7 

U3 + tz ,- T 

2 I 
\u 3 -h\ < 7 

1^1 — u 3 \ < a 
ui±h c j 

\ui — ti\ < 7 

, 2 = 1 

m - 1 3 \ < 7 

\ u i — u 3 \ > a 
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The size of the region Q is 0(|/|a 7 2 ). Using Lemma 13.11 in conjunction 
with the power series expansions of K n , the entries in the matrix are: 



1 



O(n) 
0(ra 2 7 ) 

O(n) 
0(ra 2 7 ) 



0(ra 2 7 ) 

0(nV) 
0(ra 2 7 ) 

O(nV) 



duidtidusdts 



O(n) 
0(ra 2 7 ) 

O(n) 
0(n 2 7 ) 

Each term in the determinant, and hence the determinant itself, is 0(n 8 7 ). 
Multiplying this by the size of the region of integration yields the total 
contribution of 0(n 8 |/|a 7 6 ) of off block diagonal terms in the region Q a . 



0(n 2 7 ) 
0(n 3 7 2 ) 
0(n 2 7 ) 
0(n 3 7 2 ) 



The size of the region is 0(|I| 2 7 2 ). Again using Lemma 13.11 the terms 
in the matrix are: 

Ofi) 



1 



0(n) 0(n 2 7 ) 
0(n 2 7 ) 0(n 3 7 2 ) 

O(^) O(^) 



O(n) 0(n 2 7 ) 
0(n 2 7 ) 0(n 3 7 2 ) 



duidtidusdtz 



The off block terms in the determinant above are 0( n 1 )■ Multiplying this 
by the size of the region of integration yields 0( " 7 ). 



Now we wish to choose a to minimize the total error from the off block 
diagonal terms. We use the method of dominant balance to choose a = 

5 ) so that the two contributions from Q a and Qq, are comparable. The 
resulting contribution from the off block diagonal terms is 



o 



\I\n 



Let us generalize the above example to other clean collapses. Consider a 
clean collapse of (h\ji) A (I2U2) A ••• A (ik\jk) into I clusters. Using the 
same techniques as above, the contribution from Q a is 0(re 4 '|/|' _1 a 7 3 '). 

The contribution from is O ^(^) 2 (|^l n4 7 3 )'^ • Using the method of 



dominant balance, we again choose a 



The total contribution of 



the off-block-diagonal terms to a clean collapse with I clusters is 

1 



O 



\I\n 



(nVl7 3 )' 



5.3. Controlling the Mixed Collapses. Let us consider one of the 
mixed collapses from G 3 1 n : 

(*1 , , *3 1 Jl I *2 , J3) ■ 
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The contribution to E(G^ 1 n ) from this mixed collapse may be written as 
an integral: 



E 



^2 g{th,t h )g(t i2 ,t h )g(t hj t i2 



\ distinct 



g(u 1 ,t 1 )g(u 2 ,u 1 )g(u 1 ,u 2 ) 



K n (ui,ui) K n (ui,h) K n (u 1 ,u 2 ) 1 
K n (t 1 ,ui) K n [ti,ti) K n (ti,u 2 ) du 1 dt 1 du 2 
K n {u 2 ,ui) K n (u 2 ,ti) K n (u 2 ,u 2 ) _ 

K n (ui,ui) K n (ui,ti) K n (ui,u 2 ) 
K n (ti,ui) K n (ti,ti) K n (t 1 ,u 2 ) duidtidu 2 , 
K n (u 2 ,ui) K n (u 2 ,ti) K n (u 2 ,u 2 ) 



where is the region: 



(ui,ti,u 2 ) 



2 I 

\U\ — ti| < 7 

»l+"2 a T 
2 I 

\u\ - u 2 \ < 7 



As before, we perform row and column operations. For each cluster, one 
of the equivalence classes is selected; in this case the equivalence class rep- 
resented by the variable u\. The u\ column is then subtracted from the 
columns corresponding to the other equivalence classes in its cluster. Sim- 
ilarly, the u\ row is subtracted from the others equivalence classes in its 
cluster. We then use Lemma 13. II to estimate the terms in the matrix. 

K n (m,ui) K n (ui,ti) - K n (ui,ui 

K{ti, Ul ) - K( Ul , Ul ) K(t Xl t x ) + K{ux,ux) - K{u x ,t x ) 
K(u 2 , iti) - K(ux,ux) K(u 2 ,ti) + K(ux,ux) - K(u\,t\) 

K n (ux,ux) 

K n (ux,u 2 ) - K n (tx,ux) 
K n (ux,u 2 ) - K n (u 2 ,ui) _ 

0(n 2 7 ) 



1 



K(tx,ux) 
K(u 2 ,ux) 



1 



K n {ui,u 2 ) 
K n (tx,u 2 ) + K n (ux,ux) 
K n (u 2 ,u 2 ) + K n (ux,ux) 

0(n 2 7 ) 



duidt\du 2 



O(n) 

0(n 2 7 ) O(nV) O(raV) 
0(n 2 7 ) 0(n 3 7 2 ) 0(n 3 7 2 ) 



du\dt\du 2 . 



The size of each term in the determinant is 0(n 7 7 4 ). The size of the region 
of integration is 0(|/|7 2 ), so the total contribution from this mixed collapse 
is 0(|/|n 7 7 6 ). 



To generalize, consider a mixed collapse of (*i A (I2U2) A - • • /\{ik\jk) with 
lx equivalence classes and l 2 clusters. The size of the region of integration is 
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0(|i"|V 1- ' 2 )- The size of each term in the determinant is 0(n ll (n^) 2 ^ 11 l ' 2 ^). 
The total contribution is 



0(|I|'2 7 3«i-3*2 n 3«i-2J 2 ) 
,h-l2 

O 



(n\I\)h-th J ' 

where 

1/1 n 4 7 3 

a = . 

P 2vr 72vr 



The mixed collapses of A (I2IJ2) A ■ ■ ■ A (ik\jk) have the constraints 

2l 2 <h< 2k. Thus /1 - 2l 2 > 1. Since h > 2, 

ii-i2 = ^i + ^(ii-2i 2 )>|>l. 

Considering the expansion of (n|ji) A ( 1 2 1 J2 ) A • • • A (ifclifc), one sees that 
h — h < Since 

1 < ii — i 2 < fe, 

the main contribution to E(G^ 1 n ) contains contributions comparable to 
fi ll ~ l2 . Relative to the main term, the contribution from this, or any, mixed 
collapse is 




6. Review of Case 1: The Circular Unitary Ensemble 



There are several sources of error in the estimation of E(G* In ). We sum- 
marize these sources in a table. In this table, /i = ^"J? 1 . 



description 


contribution 


relative size 


main term: 

block diagonal clean collapse 


EjLi a k,j^ 


1 


approximation E^G^j^) \i 
used to estimate main term 


H l O (n- 1 + n 2 7 2 + (n 2 7 2 ) 3 ) 


O (n" 1 + n 2 7 2 + (n 2 7 2 ) 3 ) 


clean collapse with I clusters, 
off-block diagonal terms O a 


O {n Al I l - l a^ 1 ) 


O(f) 


clean collapse with / clusters, 
off-block diagonal terms £la 


o((^) 2 (W)') 




clean collapse with I clusters, 
total of off-block diagonal terms, 

choosing a = (^) 3 


o((^) i (/nV) / ) 




mixed collapse, l\ equivalence 
classes, and l 2 clusters 
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7. Case 2: Universal Unitary Ensembles 



Let V(x) be a potential which is real analytic on 1Z and has sufficient growth 
at oo: 

hm - — — = = oo. 

|x|-*oo iog(x z + 1) 



The universal unitary ensemble UUE n , with potential V(x), is the set of 
nxn Hermitian matrices M = (my) with joint probability density function 

(const) e~ n ^ v{x ^dM 

The joint probability density function for the eigenvalues is obtained from 
the joint probability density function for the matrix entries using Weyl in- 
tegration. See Appendix [13l 

(const) Y[(Xi- Xj) 2 e- n ^ v ^dA 

i<j 



As discussed in Section 111.11 we assume that the potential is regular and 
that the equilibrium measure ^(x)dx is supported on a single interval [a, b]. 



7.1. Heuristic Prediction for the Universal Ensemble. As in the 

case of CUE, we again make the simplifying but false assumption that the 
consecutive spacings are independent random variables. Thus j^, as 
a sum of many unlikely events, will be approximately Poisson. We now 
estimate its mean. 

At a point x £ I, the local density of eigenvalues is given by the diagonal of 
the projection kernel: 

density = K n (x, x). 

At the point x, a separation of 7 is equal to jK n (x, x) times the local mean 
spacing. Thus the probability of any one of these spacings being less than 
7 is approximately 



In the region [x,x + dx], the number of consecutive spacings is approxi- 
mately K n (x,x)dx. Integrating over x £ I, the expected value of G 7j / jn is 
approximately: 



E(G 7 j, n ) — — J K n (x,x) 4 dx. 



For x G [ a + e,b-e], K n (x,x) = n^{x){l + 0( n - 1 )). See [DKM^97j. Thus, 



the heuristic prediction agrees with the Moment Estimation Theorem. 
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8. The First Moment in the Universal Case 



As in the case of CUE, G 7j / jn is the symmetrization of a function of 2 
variables: 



92(x,y) 



x — y\ < 7 
otherwise 



5 ^ 



Gy,I,n (*1 , t2, ■ ■ ■ , t n ) — ^2 g% (ti ,tj). 

i+3 



Using the method of Gaudin (see Appendix [Tl|) . the expected value of Gy t i >n 



is 



E[G 1>I>n ) = j J J 2 {u,t) 



K n (u,u) K n (u,t) 
K n (t,u) K n {t,t) 



dudt 



K n (u,u) K n (u,t) 
K n (t,u) K n (t,t) 



dudt. 



In the above formula, the region is the set where 32(^1,^2) = \'- 



n 



(u,t) s.t. 



u+t f- j 

2 1 
\t — U\ < 7 



The projection kernel K n (x,y) is defined in Section [T41 

Compared to the CUE n case, it is now slightly more difficult to estimate 
E(Gyj t n) because the kernel K n (x,y) no longer depends solely on the dif- 
ference (y — x). 

The region f2 is narrow in the (t — u) direction. This suggests expanding the 
integrand in a Taylor series. Let x = and y = Then dxdy = ^dudt. 
In terms of the new variables x and y, the first moment is: 



E(G^yj :n ) 



K n (x + y,x + y) K n (x + y,x - y) 
K n (x -y,x + y) K n (x-y,x-y) 



dxdy 



2 7 K n (x + y,x + y)K n (x-y,x-y) 
-K n (x + y,x -y)K n (x -y,x + y) 

2 ' 



dxdy 



+O(y 4 c c 4 + y 5 cic 4 H h y 8 c 4 c 4 ) 



dxdy, 
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where 



and Cj is the maximum over Q, of any (j')th partial derivative of K n (x,y). 
Using the Lemma [l2.2l the integral of 0(y 4 coC4) over Q is O (|/ n |7^n 6 ) , the 
integral of 0{y b c\Ci) is 0(|/ n |7 6 n 7 ), and so on, and the integral of 0(y 8 C4C4) 
is 0(|/ n |7 9 n 10 ). Dropping the redundant error terms, 



+0(|/ n | 7 5 n 6 + |/ n | 7 V ). 



We first evaluate the derivatives , ^ ', and symbolically as a 

function of x and y. Let 



[i,j,k] 



y) h 



where rjj(x) = (j)j(x)e 2 V ( X \ and (f)j is the (j')th normalized orthogonal 
polynomial with respect to e~ nV ( x 'dx. In terms of this new notation, we 



24 

have the following expressions for K(x, y) and its derivatives: 



dx 
d_ 

dy 
dx 2 

d 2 

dxdy 

d 2 



K = 
K = 



K = 



K 



K 



dy' 



K = 



[0,0,1] 

[1,0, 1] -[0,0,2] 
[0,1,1] + [0,0, 2] 

[2,0,l]-2[l,0,2]+2[0,0,3] 

[1,1,1] -[0,1, 2] + [1,0,2] -2[0, 0,3] 

[0,2,l]+2[0,l,2]+2[0,0,3] 
d d 



dx dy} 
[1,0, 1] + [0,1,1] 

( d 2 | 2 d 2 | d 2 \ R 

\ dx 2 dxdy dy 2 J 
[2,0,1] +2[1, 1,1] + [0,2,1] 

d 2 d 2 d 2 



+ 2- 



+ 



K 



dx 2 dxdy dy 2 1 
([2, 0, 1] - 2[1, 1, 1] + [0, 2, 1]) + (-4[1, 0, 2] + 4[0, 1, 2]) + 8[0, 0, 3]. 



We wish to evaluate the above quantities K^, /*, and along the 

diagonal x = y. At first we are taken aback by the presence of (x — y) in the 
denominator, but then recall that K n (x,y) is smooth everywhere, including 
the diagonal. We determine the limiting values along the diagonal by use of 
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L'Hopital's rule, which in the case of must be applied three times: 



K^(x,x) = [200] + [110] 
RSS( x , x) = [300] + 2[210] + [120] 
K^\x,x) = (x-y) 2 ([2,0,3] - 2[1, 1, 3] + [0, 2,3]) 

+(x - y)(-4[l, 0, 3] + 4[0, 1, 3]) + 8[0, 0, 3] 

= ^((x-y) 2 ([3,0,2]-2[2,l,2] + [l,2,2]) 

+2(x-y)([2,0,2]-2[l,l,2] + [0,2,2]) 
+(x-y)(-4[2,0,2] +4[1,1,2]) 

+(-4[l,0,2]+4[0,l,2])+8[l,0,2]) 

= i([3,0,0]-2[2,l,0] + [l,2,0]) 

+1 ({x - y)(-2[2, 0, 2] + 2[0, 2, 2]) + (4[1, 0, 2] + 4[0, 1, 2])) 

= ^([3,0,0]-2[2,l,0] + [l,2,0]) 

+^((x-y)(-2[3,0,l]+2[l,2,l]) 
+(-2[2,0,l] + 2[0,2,l]) 
+(4[2,0, 1]+4[1,1,1])) 

= Q[2,l,0] + |[l,2,0]) + Q[2, 0,1] + |[1,1,1] + ^[0,2,1] 

= Q[2, 1, 0] + |[1,2,0]^ +(^[3, 0,0] + |[2,1,0] + ^[1,2,0]) 

= i[3, 0,0] + [1,2,0]. 



and substitute the results into the integrand: 



(kk^s - KK>^ - (i^) 2 ) 

[1, 0, 0] ( [3, 0, 0] + [2, 1,0])- [1, 0, 0] Q [3, 0, 0] - [2, 1,0])- [2, 0, 0] 2 

^[1,0,0][3,0,0]+2[1,0,0][2,1,0] -[2,0,0] 2 
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We derive estimates for [3, 0, 0] and then state without proof the analogous 
estimates for [1,0,0], [2,1,0], and [2,0,0]. 



[3,0,0] 



(b-a) 



O 



(3) 



(b — a)\ . T , . . o 2 



am 



^(-ie ie ^u{x))^{e ww v{x)) - ^{e M{x) u{x))^{-ie M ^v{x)) + O ( - 



J9(x) 



iO(x). 



n 



(mr^(x)Y 
2^ 

(mrfy(x)y 
2^ 

(mr^(x)y 
2^ 



b — x 



+ 



x — a 



sin(0 + a) cos(6 + 13)- cos(6» + a) sin(6> + /3) + O 
sin(a -0) + Oin- 1 )) 



x — a 



b — x 



b — x \x — a 
x — a V b — x 



7T 



u(x) 
v(x) 



+ o(n- 1 ; 



where 



9(x) = 

u(x) = 

v(x) = 

a(x) = 

P{x) = 



nir 



J x 



x 



x — a 



+ e" 



e * 



b — x 



arg(u(x)) 
&rg(v(x)). 



x — a 



+ e 



b — x 

x — a 
b — x 



Using the same techniques, 

[2,1,0] = (Sl!W£(-l + 0(»-')) 

7T 

[2,0,0] = ^^(O + OCn- 1 )) 
[1,0,0] = (!^M(-l + 0(n-)). 
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Returning to our expression for E{G 1j i jV ), 

-L (kkSS _ KK \\ _ (RSf) dx 

= O(|/| 7 5 n 6 ,|/| 7 V ) 

+ J±&Wl + a(-i)<-i) - + o(»-)) d. 

= 0(|/| 7 5 n 6 + |J|tV ) + / ^"''^M 4 (1 + ofn" 1 )) <tt 

= [ V(x) 4 dx (1 + O (n- 1 + 7 V + ( 7 V) 3 )) . 

9 Ji 

These calculations confirm that, for the first moment S(G 7j / jn ), the conclu- 
sions of Theorem 11.41 are correct. We now estimate the higher moments. 

9. Higher Moments in the Universal Case 

As in the case of a random unitary matrix, we express E(G~,j tn ) as a sum 
of several terms, with each term corresponding to a collapse of 

(k\h) A O2U2) A (i 3 \j 3 ). 

Using the method of Gaudin, each term is expressed as an integral involving 
only a few variables. As before, the main contribution will come from the 
block-diagonal terms of the clean collapses: 

k 

(clean collapse, block diagonal) = akjE^G^j^y 

j=l 
k 

= ^a.kjrf (1 + 0(n _ Vn)) . 
3=1 

where 

The errors in this approximation have the same sources as in the CUE case: 

• the approximation E(G~ i n ) = /J, used in estimating the main term. 

• clean collapse, off-block diagonal terms. 

• mixed collapses. 



The bounds for these errors are obtained using exactly the same reasoning 
as in the CUE case. The table from the CUE case applies here with only 
minor changes. The only differences are that 
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• The value of [i has changed. 

• We now use Lemma llO.ll instead of Lemma 13. II to bound K n (x,y) 
and its derivatives. 



10. Controlling the Derivatives of K n (x,y) in the Universal Case 

For bounding the error terms for higher moments in the UUE case, we 
require estimates for K n (X,y) and its derivatives. In order to apply the 
Christoffel-Darboux formula from Lemma [bOI we approximate the coeffi- 
cient b n -\. We refer the reader to [D KM + 97] and state the result and error 
term without proof. This approximation is valid when the potential V(x) is 
regular and the equilibrium measure is supported on a single interval [a, b]. 



b n -i = I xcj) n {x)4> n - 1 {x)e nV(x) dx 
xrj n (x)rj n -x(x)dx 
J xr] n (x)r] n ^ 1 (x)dx + O 



-oo 
oo 



— oo 
6 



b 2 ( l(b-x\% 1 fx-a\^\ /l 
X (b^{-2{—a) + 2{b^) ) dX + {n 

n 



The projection kernel is 

Vn{x)r]n-i{y) - r] n (y)r] n -i(x) 



K n (x,y) = b n . 



l- 



x-y 

Lemma 10.1. Suppose that the potential V{x) is regular and that it's equi- 
librium measure ^{x)dx is supported on the interval [a, b}. Fix e > and let 
R be the region 

[a + e, b — e] x [a + e, b — e] . 
For (x,y) G R, a mixed partial derivative of K n (x,y) of total degree k > 
is O (n fe+1 ) . When \x — y\ > -, we have the stronger bound O f \ x - y \ nk 



Proof. Take k partial derivatives of K n (x,y) symbolically. The result 
is a finite number of terms of the form 

. ^\ x )nt\{y) - v { n 2 \v)rit\(x) 

On-lCkxMM ^ x _ y\i+k 3 

where k\ + k2 + k% = k, the coefficients Ck lt k 2 ,k 3 are integers which do not 
depend on n, and a superscript in parenthesis indicates multiple differenti- 
ation. 
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We divide the region R into two regions: 

Rdiag = l(x,y) € R \x - y\ < ^ | 

Rbulk = \(x,y) £ R 



1 

\x — y\> — 

n 



In the region Rbulk, rfh l \x) and rj^}_\{x) are of size 0(n kl ); iyh 2 \y) and 
V^n-liv) are °f s i ze 0(n fc2 ); and ^_^i+fc 3 is of size O (^j^z^n h3 ^j . The total 



contribution in the bulk region is then 

O ( r-^n k 
\x - y\ 



The region Rdiag is more subtle. Consider the term 



(x-y) 



l+k 3 



We treat both ?y„ 1 {x)rjl l 2 l (y) and ry„ 2 (y)r/„i 1 (x) the same way, so let us 

discuss f]^ 1 \x)r]^l(y) ■ We expand rj^\(y) in a Taylor series centered at 
x. 



for some x between x and y. Selecting the remainder term yields 

Vn {X) ( fc3+1 )i Vn-1 \ X ) 

(y - x) 1+fc 3 
= 0(n fcl+fe2+fca+1 ) = 0(n 1+k ), 

which is within the desired bound. Selecting one of the other terms in the 
expansion of r)^\(y) yields 

(y - x) 1+k, i 

(y - x) 1+fc 3-i ' 
for some < j < k^. Observe that this term is of the form 



,-nV(x)_ 



p(x) 



(y - x) 1+k 3-3 ' 



where p(x) is a polynomial in x, and the exponent 1 + k% — j in the denom- 
inator is positive. Upon adding all such terms in the Taylor expansion, for 
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every term in the symbolic differentiation of K n (x,y), the result is 

fc3 I \ 

E e -nV{x) Pj{ x ) 
(y - x) 1+fc 3-J ' 
3=0 yy ' 

Since K n (x,y) = Y^l=o Vi( x )Vi(y)> it is smooth along the diagonal x = y 
and Pj(x) is identically zero for < j < k$. □ 



11. Asymptotics for Orthogonal Polynomials with General 

Weights 

We outline the derivation of the leading order asymptotics for the rj n (x) and 
■q n -i(x), where 

_ 4>j{x) 

and 4>j(x) is the (j)th normalized orthogonal polynomial with respect to the 
measure e~ nV ( x \ 



Our derivation follows the exposition of |DKM + 97 and |Dei99j very closely 



and omits the proofs of several facts which ar e proven in th ese a rticles. The 
strongest results in this direction are proven in [DKM±99aJ and |DKM+99b] , 
The only departure from their presentation comes when we obtain bounds 
for the derivatives of L(z), and then use these bounds to derive the leading 
order asymptotics for the derivatives of rj n (x) and r] n -i(x). 

The leading order asymptotics of the derivatives of r\ n and r? n _i turn out to 
be the derivatives of the leading order asymptotics. Of course one expects 
this, but it is not something which can be taken for granted. 



11.1. Facts About the Equilibrium Measure of a Potential. Let 

V(x) be a potential which is real analytic on 1Z and has sufficient growth at 
oo: 

lim - — = 00 

\x\^oo log{X z + 1) 

This growth condition guarantees, in particular, that all the moments of 

e -nV(x) are fi n it e _ 

Given a probability measure \x on K, let I v be the energy functional: 

I V (H)= I I log\t- s\- 1 dfi(t)dfi(s)+ [ V(t)dn(t) 
Jn J it Jk 

Theorem 11.1. Let V and I v be as above. Then there exists a unique 
probability measure \i = yY such that 

E v = inf I v (y)=I v (y v ), 

/LteM 1 
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where the infimum is over all probability measures on 1Z. The equilibrium 
measure fi v is compactly supported. 

Let the support of fjX be the following disjoint union of intervals: 

I 

Supp(^) = {J[aj,bj]. 

j=l 

Now if 

• V{x) is real analytic 

• V(x) grows fast enough at infinity: lini| a .|_ >00 ^Jfj^ = oo, 

then the equilibrium measure is ^(x)dx, where 

*B(x) = R + (x)^h(x) 
i 

R ( x ) = aj)(z-bj), 
i=i 

and h(x) is real analytic on 1Z. Sec [DKM98J. 

Since is continuous one can use the calculus of variations to derive the 
Euler-Lagrange equations for the equilibrium measure: 

Theorem 11.2. There is a constant I G 1Z such that the equilibrium measure 
fi satisfies the following conditions. 

• 2 J* log \x - y\- l dfi v (y) + V{x) > I for all x £ K. 

• 2 f log \x — y\~ 1 d[i v (y) + V(x) = I for x in the support of 

Conversely, if a compactly supported measure fj, satisfies the above conditions 
for some I, then it is the equilibrium measure [i v . 

If ib{x) > except at the endpoints of J, and we have strict inequality in 
the above theorem for x J, then we say that the potential V(x) is regular. 
Otherwise we call the potential singular. 

For simplicity, we consider only the case when V(x) is regular and the sup- 
port of tp is a single interval J = [a,b]. This is always the case when V(x) 
is convex. 

11.2. Uniqueness for 2x2 Riemann Hilbert Problem. Let £ a 

contour in C, and So be the same contour excluding the points of intersec- 
tion. Suppose that u(z) : T,q — > GL(2,C), such that v is smooth, bounded, 
and approaches Id rapidly on the unbounded components of Sq • 
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Lemma 11.3. Suppose that m : C \ S — > GL(2,C) satisfies the Riemann- 
Hilbert problem (T,,v) if 

(1) m is analytic in C\T, 

(2) m + {z) = m-(z)u(z) for z G T,q. 

(3) m(z) — » Id as z — ^ oo. 

Assume further that det(V) = 1. Then the solution of the Riemann-Hilbert 
problem is unique, if it exists. 



Proof. We sketch the proof from p. 194-198 of [Dei99]. First, because 
det(V) = 1, det(m) is analytic across the contour. So det(m) is an analytic 
function and, because of the behavior of m at infinity, det(m) must be the 
constant 1. In particular m is always invertible. 

Now suppose that m and rh are two solutions. Then, using an algebraic trick 
specific to 2 x 2 matrices with determinant 1, it turns out that H = rhm^ 1 
satisfies a Riemann-Hilbert problem with the same contour but v = Id. In 
other words H is analytic, approaches Id at infinity, hence is identically 
equal to Id, establishing uniqueness. □ 



11.3. Expressing Orthogonal Polynomials as the Solution of 
Riemann Hilbert Problems. Let TTj(z) be the monic orthogonal polyno- 
mials with respect to the weight e~ nV ^ z \ 

Theorem 11.4 (Fokas, Its, Kitaev). Let Y(z) be the 2x2 matrix-valued 
function satisfying the following RHP: 

(1) Y(z) is analytic in C\TZ. 

(2) Y(z) = (1 + OCe- 1 )) ( Z q z - q ) as z -)■ oo, z € C \ K. 

( i p -nV(z) \ 

(3) Y + (z) = Y^z)^l e 1 jforzeTZ. 

Then Y encodes information about ir q (z), the (q)th monic orthogonal poly- 
nomial with respect to the measure e~ 

-nV(x) dx . 



Y(z) 



n q\ z ) J-R s-z 2-Ki 



\ lq-YKq-l\Z) lq-1 ) n s _ z 2 ni 

where the constant is 

-2-Ki 



lq-1 



Jtt; 2 Js)e- nV ( s )ds' 



3.3 



The 2x2 jump matrix in this Theorem has determinant 1, so the solution of 
the Riemann-Hilbert problem is unique if it exists. To prove this Theorem, 
one verifies that the given solution satisfies the Riemann-Hilbert problem. 



11.4. Removing the Bulk of the Oscillatory Behavior. For $s(z) / 



We choose log(z) to have a branch cut along the negative real axis, and to 
be real on the positive real axis. 

Observe that g{z) is analytic in the region C \ (— oo, bj\. For real z, let g+{z) 
be the limiting value from above the branch cut, and g~(z) the limiting 
value from below. Then 



Observe that because ^(s)ds is a probability measure, g+(z) — g-(z) = 2iri 
for z < a. Thus e ng ^ is analytic in C \ [a, b\. 

For z G C \ K, let 



0, let 






Then satisfies a simpler Riemann-Hilbert problem: 



(1) mS 1 \z) is analytic in C \ TZ 

(2) m [ l\z) = m a) (z)uW(z) for z G TZ 

(3) (z) = I + 0(z- r ) as z -»• oo, 



where 



v 





e n(g-{z)-g+(z)) 




e n(g-(z)+g+(z)-V{z)+l) 
e n{g+(z)-g-(z)) 




3d 



The upper right entry in the jump matrix v is e n( - 9++9 ~ ^+0. "We analyze 
the exponent in detail. 



g + (x) + g_{x)-V(x) + l 

/oo 
logflx- s\)V(s)ds- V(x) + 1 
-oo 



Recall that the equilibrium measure \& satisfies the Euler-Lagrange equa- 
tions. Thus 

C < for x < a 
g+{x) + g_(x) - V(x) + 1 < = for a<x<b 

{ < for x > b 

Because of our assumption that the potential V is regular, the inequalities 
above are sharp. 

A further simplification to the jump matrix is this. For x < a and x > b, 

e n(g+(x)-g-(z)) _ e n(g+(x)-g-(z)) _ j 



To summarize, the jump matrix defined on 1Z has the following properties 
in the three regions: 



1 e n(g-(x)+g+(x)-V(x)+l) 

1 

e -n(g+(x)-g-(x)) ^ 

e n(g+(x)-g-(x)) 

1 e n(g-(x)+g+(x)-V(x)+l) 
1 



for x < a 
for a < x < b 
for x > b 



11.5. Analytic Continuation. The difference g+{x) — g_ (x) has been 
defined on the real axis as the difference between g(z) on opposite sides of 
a branch cut. Recall from the previous Subsection that g+(x) — g~(x) has 
an integral representation for x 6 1Z. Call this difference G(x). 

G{x) = 2-ni j ^{s)ds. 

Recall that because V(x) is real analytic, ^(z) is an analytic function times 
y/(z- a)(b - z): 

^{z) = h{z)^J{z-a){b- z). 

The function h(z) is analytic in a neighborhood of 1Z, and real on the real 
axis. We choose ^(z) to be positive on the interval [a, b] and have branch 
cuts along (— oo,a] and [b, oo). Because of its integral representation, G(z) 
is also analytic in this region. 
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Figure 1. It is easy to translate between a solution of one 
Riemann-Hilbert problem and a solution of the other. 



11.6. Factoring the Jump Matrix. Along (a, b), G'[x) = — 2m^{x). 
Thus there is a complex neighborhood U of (a, b) such that Sft(Gr) > in 
U DC + , and < in UC\C~ . Because G behaves as z 3 / 2 at the endpoints 

a and 6, C/ cannot form an angle greater than ^, with respect to the real 
axis, at either endpoint of [a, b]. See Figure [TJ 

In the interval [a, 6], we factor the jump matrix as 

v = ( e nG(.) 1 ) ( -1 ) ( e -«GW 1 ) 

(1) (1) (1) 
= v_ v\ 

As illustrated in Figure HJ we deform the original contour E^ 1 ) to obtain a 
Riemann Hilbert problem on a new contour S^ 2 \ It is possible to translate 
between a solution of one Riemann-Hilbert problem and a solution of the 
other. Except for z in the region enclosed by the lenses, m^ 2 \z) = m^>(z). 
Inside the upper region, 

m {2 \z) = m {l \z) (u^\z)Y 1 . 
For z inside the lower region, 

m^(z) =m^(z)v W {z). 

The jump functions on the new contour are: 
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1 e n(g-(z)+g+(z)-V(z)+l) 

1 
1 

e nG(z) ± 

1 
-1 

1 

e -nG(z) i 

1 e ™(<?- (*)+!?+ (*)-^(*)+0 
1 



for 


z < a 




for 


z on lower 


lens 


for 


a < z < b 




for 


z on upper 


lens 


for 


z > b 





All of the exponents appearing in these jump functions are negative. As 
n — > oo, these terms disappear. The limiting jump functions are: 



v {oo) {z) = I 




for 


z < a 






for 


z on the 


lower 


lip 


for 


a < z < b 






for 


z on the 


upper 


lip 


for 


z > b 







Let m/ 00 ) be the solution of this limiting Riemann-Hilbert problem. The 
convergence — > as n — > oo is not uniform and occurs more slowly 
in neighborhoods of a and b. Thus we cannot automatically conclude that 

VTL2 —5- moo as 71 — \ OO. 

Because we assumed the potential V{x) is regular, there are no other areas 
of slow convergence. 



11.7. Solution of the Limiting Riemann Hilbert Problem. To 

find the solution moo, we perform a change of basis to diagonalize the jump 
function. This replaces the 2x2 Riemann-Hilbert problem for 777,00 with a 
pair of scalar Riemann-Hilbert problems. 

For a < x < b, the diagonalization of the jump function is: 
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The transformed Riemann-Hilbert problem is: 

-l 

/ l i 

m(z) 



Hz) 



1 




— i 


)" 


1 


^ 







i 








—i 


1 


^ 








1 1 

i —i 



for x < a 
for a < x < b 
for x > b 



The solution of this pair of Riemann-Hilbert problems is 

m(z) = 



p 








r 1 






z — 


by 


z — 


a) 



p = 

where /3 is analytic on C \ [a, b] and j3(z) — > 1 as n — > oo. 
Changing back to the original basis we recover the solution m 0O (z): 

m=o(z) = ( J 2j )*(»)( ! i ) 

0+0- 1 fi-f}- 1 \ 

2i 2 / 



11.8. Parametrices at the Endpoints. The convergence z^ 2 ) — >■ ^(°°) 
is not uniform at the endpoints a and 6. Thus the approximation (z) ~ 
m (oo) ^ j g i na pp r0 p r i a te near a and 6. 

Let O a and be neighborhoods of a and 6, which can be chosen as small 
as desired. Inside the neighborhood O a , one constructs a parametrix m a 
which satisfies the following Riemann-Hilbert problem: 



• is analytic on O a \ T>^ 

• m ( +\z) = m^\z)u^{z) for z G S( 2 ) n O a 

• m^{z) = m(°°)(z) (1 + O (I)) for z G dO a . 



In other words, m a satisfies the Riemann-Hilbert problem (E^ 2 ) , v^) exactly 
in a neighborhood of a, and matches m^°°> to within 0(n _1 ). Similarly, one 
constructs a parametrix in CV We will not discuss the construction of 
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exponentially small 



0(l/n) 



exponentially small 



0(l/n) 




exponentially small 
Figure 2. The Riemann-Hilbert problem for L. 



exponentially small 



thes e parametrice s, which comprise a large part of the work of DKM + 99a 
and [DKM+99b] . 

We now patch together an approximate solution of the Riemann-Hilbert 
problem for m^: 

m (oo) for z( zC\O a \O b 

(p) = } m (a) for z g Q a 
m W f or 2gOl 



The actual solution will be recovered as a perturbation of the approxi- 
mate solution. Let L(z) = m^ 2 \z)(m^ pS> (z))^ 1 , so that rrS 2 " 1 can be recovered 
from and L. Then L approaches Id at oo and has a jump function as 
indicated in Figure [2j 



Observe that the jump function for L(z) in in both L 2 (Yi) and L°°(E). 



11.9. The Solution L(z) and its Properties. Since m^ 2 \z) = L(z)m^ p \z), 

the quantity L(z) — Id plays the role of an error term in the approximation 
irS 2 \z) = m^ p \z). In order to bound these errors, we make use of Theo- 
rem [TT3J which appears in [Dei99j . near p. 219. 

Suppose that for a given Riemann-Hilbert problem (S, v), the jump function 
v factors as v = bZ 1 b+, where the b± are bounded and invertible. Let 

b± = Id db oj± and to = u + + to- . 

For / € L 2 (S), let C± be the limits of the Cauchy operators: 

(Cf)iz) = — [ -^-ds for zeCC\Z 
2m Jy, s — z 

(C±f)(z) = lim Cf(z') for z G S 

z'— >z 

Let C w f = C+(fu)-) + C-{fu + ). Assume that oj± are in L 2 (E), so that 
C^IdGL 2 ^). 
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Theorem 11.5. Suppose that I — C u is invertible on L 2 (X) and let \i G 
/ + L 2 (S) he the unique solution of 

(I - C w )» = Id 

or, more properly, 

(/ - C u )(n - Id) = C^Id G L 2 (£). 

Then 

m(z) = I + (C(fxui))(z) for zGC\£ 
is the solution of the Riemann Hilbert problem. 

We now depart from [De i99j in order to bound L(z) and its derivatives. 

In our case, we may choose 6_ = Id, 6+ = v, and u>+ = v — Id. Then 
C w / = CL(/w + ). 

For smooth contours, the C± are bounded on L 2 (T>) with constants depend- 
ing only on S. Since o; + G L°°(S), 

||a/|| L2(S) < c\\u; + f\\ LH ^ 

< c||w + || L oo( S )||/|| i 2( S ). 

Since ||w + ||^cx)( S ) = 0(^)j we eventually have ||C W || Z/2 ~ >Z ' 2 < \. Once this 
occurs, (I — C w ) is invertible with L 2 — > I? norm at most 2. This allows us 
to bound - Id) in L 2 (S): 

11/^-^1^2(2) < Cw)Cw-Jl|.L2(£) 

< 2||C aJ J|| i 2( E ) 

< 2c||w + || L 2( E) 

Finally we have an expression for L(z): 

L{z) = Id + J_ f J^L ds . 

Z7TZ Js S — Z 

Since /i G L 2 (£) and u G L 2 (S), (//a;) G L*(E) and 

i i 

H^Hli(e) < IMIl2 (s) IM|£ 2(s) 

o 

n 













K n) 






o( 






\ n) ' 
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For z not within distance d of the contour S, 

1 f flOJ 



\L(z) - Id\ 



ds 



< 



2ni s — z 

IIHIli(e) 



2vrd 



1 



nd , 



For \z — S| > (i, we can also control the derivatives of L(z): 



Qk 

dz k 



L(z) 



k\ 



fJLUJ 



k+l 



2iri Jy, (s — z 

= o- 1 



ds 



nd k+1 J 



11.10. Retracing Our Steps. We now retrace our steps in order to 
obtain asymptotics for orthogonal polynomials, on the real axis for a < z < 
b. 

First, we choose a proper subinterval of [a + e, b — e] C [a, b\. Next, review 
the above steps to make sure that the neighborhoods O a and Ob have radius 
strictly less than e. Let d be the distance between the contour £ and the 
interval [a + e, b — e] . 

For z G [a + e, b — e], we obtain the asymptotics for Y(z) by approach- 
ing the real axis from above. Recall that Yn(z) = ir n (z) and Y2\(z) = 
— i(const)-7r n „i(z) are the quantities that interest us, and are unaffected by 
the jump function along the real axis. 

Y{z) = ( e " Q T I ] L{z)m^\z) ( e J G(z) J ) 

^ p ng+(z) 

e~^~L n (z) e~^L 12 (z 
e~L 2 i(z) e~L 22 (z) 
( e f+ng+{z) o 

X ( e f +ng + (z)~nG(z) e -f+ng+(z) 





The entry Yu(z) in the above product is 
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27 



2/ 



3 n S+ (.) e ^( Lll ( z ) + iLl2 (z)) 



b - z\ 4 



z — a 



+e-^(Ln(z) -iL 12 (z)) 



z — a \ 4 



6-z 



9f 



s n ^J?*W d8 ie^(L u (z)+iL 12 (z)) 



b- z\ 4 



z — a 



+e-^(L u (z)-iL 12 (z)) 



z — a \ 4 



6-z 



The entry Y 2 i{z) in the above product is: 



s n«+n/ a f> log|2-s|<I'(s)ds x 



e ^(L 22 (z) - »L 2 i(*)) 



6 - z\ 4 



2 — a 



+e-^(L 22 (z) + iL 21 (z)) 



z — a \ 4 



6-2 



In the Christoffel-Darboux formula, it is easier to deal with normalized or- 
thogonal polynomials (ftj(z) than with monic orthogonal polynomials 7r,-(z): 



<f>j(x) 



7Tj(x) 



!-oo^A x )) 2e ~ nV{x)dx 



When j = n, we approximate heuris tically the n ormalizing constant in the 
denominator, refer the reader to to [DK M + 97 for the precise result, and 
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state without proof the error term in our approximation: 

poo 

(ir n (x)) 2 e- nV( - x) dx 
1 + ° Q)) j\^)) 2 e~ nV(x) dx 

= ( l + O ^~yj X J* (?n S b a log\x-s\*(s)ds-nV{x) 



oscillatory 



6 — x \ 2 / x — a A 2 
+ 



x — a 



b — x 



dx 



l + O ( - | | ( 

n 



,6 x 
J a 2 



l . l- 

6 - x\ 2 f x - a\ 2 . . 

+ I I I dx 



x — a 



b — x 



-nl ' 



Even more convenient than the normalized orthogonal polynomials, for later 
computations, are 

_ 4>j(x) 

T)j(X) - e n v{x y 

The advantage of using r]j(x) is that they are orthonormal with respect to 
Lebesgue measure on TZ. 

The entry Y2\(z) is a negative imaginary constant times -K n -\(z). Using tech- 
niques similar to those above, we approximately normalize this polynomial, 
obtaining an approximation for <^> n _i(x). 

Using our knowledge of Yn(z), Y2\{z), the normalizing constants, and L(z), 
we obtain the leading order asymptotics for r] n (x), %_i(x), and their deriva- 
tives. 

Theorem 11.6. Let e > 0, and k a nonnegative integer. For z G [o+e, b— e], 
the leading order asymptotics of 



dz k 



r)n(z) 



gk 

and —^r] n ^i(z) 



as n — ^ oo are, respectively, 
(nix^{z)) k 1 



(b — a)ir 



l k e ninf z ^(s)ds [ p i 



e 4 



13 



and 



■ft 



(6 — a)ir 



e 2 4 



1 1 ' 

6 — z \ 4 ^ f z — o\ 4 

+ e ! < 



z — a 



b-z 



+ o 



TTie constants implied by O depend only on k, the potential V(x), and e. 



12. Special Case: The Gaussian Unitary Ensemble 



The ensemble GUE n is the set of n x n Hermitian matrices with joint prob- 
ability density function (j.p.d.f.) 

exp - m h ~ 2 I' dM 

= C n e~ Tr ^dM 

In other words, the entries on the diagonal are real, and are chosen indepen- 
dently from the Gaussian distribution with variance 1. In the upper triangle, 
the real and imaginary parts of each entry are chosen independently, all from 
the Gaussian distribution with variance ^. 

The j.p.d.f. for the eigenvalues is obtained from the j.p.d.f. for the matrix 
entries using Weyl integration: 

(nG9*^W-^.- E * 

y=o KJ 7 / i<j 

At first glance the Gaussian unitary ensemble seems not to be an instance 
of the universal unitary ensemble, because the j.p.d.f. contains the term 
e~^ x i instead of e~ n ^ Xi . Since the potential V(x) = x 2 is homogeneous, 
this difference is eliminated by rescaling. Thus, the Moment Estimation 
Theorem we present for GUE is really a special case of the universal version 
of the Theorem. 

Instead of just using the universal Theorem, we prove the Moment Esti- 
mation Theorem in the GUE case by using Plancherel-Rotach asymptotics 
for Hermite polynomials instead of the more general asymptotics by Deift, 
Kriecherbauer, McLaughlin, Venakides, and Zhou. The standard reference 
for Plancherel-Rotach asymptotics is [Sze75j. 

We present this alternative proof because many readers will be familiar 
with Plancherel-Rotach asymptotics but not the Deift-Zhou asymptotics. 



7T 



„ 2 n(n + l) 

— 2 — 2 — 
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For such readers, the GUE version of the Moment Estimation Theorem is a 
good warm-up for the universal case. 

Theorem 12.1 (Moment Estimation Theorem, GUE version). Let e > 0. 
For each n, let I n C (— (\/2~ — e)Vn, (\/2 — t)^/n) ^ e an interval, such that 
^Jn\I n \ — )■ oo as n — > oo. Let G 7i / iW 6e i/ie random variable which counts the 
number of GU E n eigenvalues whose average is in I n and whose difference 
is at most j n . Let be the Poisson distribution with mean 

7T 2 7 3 f 

li n = — — J K n (x,x) A dx. 

Then for all k > 1, 

E(G k jJ:n ) = £7(Gj) (l + O (n- 1 ,n 7 2 , (n3|J|)-§)) . 
The constant implied by O depends only on k and e. 



The novel aspects of the proof of the GUE Theorem are the estimation of 
the first moment using Plancherel-Rotach asymptotics, the different scaling, 
and a slightly different proof of the bounds for K n (X, y) and its derivatives. 
We only treat these aspects of the GUE proof. 



12.1. The First Moment in the GUE Case. As in the CUE and 

UE cases, one uses the method of Gaudin to express the expected value of 
G~/j, n as a two-dimensional integral. 



£(G 7 ,/,n) = / I 92(u,t) 



K n (u,u) K n (u,t) 
K n (t,u) K n (t,t) 



dudt 



K n (u,u) K n (u,t) 
K n (t,u) K n (t,t) 



dudt, 



where O is the region 

n ■■ 



(u,t) s.t. 



U+t a T 

. 2 f= 1 

\t — u\ < 7 



We state without proof an expression for the projection kernel: 

' H n (x)H n _i(y) - H n _i(x)H n {y) 



K n {x,y) = — ———== I 

2 n (n — lj'v 71 " V x — y 

where H n (x) is the (n)th Hermite polynomial. This expression is an instance 
of the Christoffel-Darboux formula, which we encountered in the UUE case. 
The only challenge in the above formula is to derive the correct normalizing 
constant. 

As compared to the U n case, it is now slightly more difficult to estimate 
E(G 7 j >n ) because the kernel K n {x,y) no longer depends solely on the dif- 
ference (y — x). 
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As in the case of UUE (Subsection | 
integrand in a Taylor series: 



we change variables and expand the 



The region Q is narrow in the (t — u) direction. This suggests expanding the 
integrand in a Taylor series. Let x = and y = Then dxdy = ^dudt. 
In terms of the new variables x and y, the first moment is: 



2 (kkS/~ - KKy** - K^ 2 ) + O(y 4 c c 4 + y 8 c 4 c 4 )) dxdy 



where and Cj is the maximum over of any (j')th partial derivative of 
K n (x,y). Using the Lemma 112.21 instead of Lemma 110.11 the integral of 
0(y 4 coC4) and 0(y 5 C4C4) over Q, are O (|/|7 5 n 3 ) and O (|/|7 9 n 5 ) respec- 
tively. 

We explicitly expand the integrand in terms of Hermite polynomials. After 
collecting similar terms, 



O 



|/| 7 V + |i| 7 V)+ /_2_ 



-2t 2 



2 2n (n-l)\ 2 n 



-{H n H'l_ x - H n -iH'n) 2 



dt. 



All Hermite polynomials or their derivatives are evaluated at t; this is omit- 
ted to save space. In the next section, we will use the Plancherel Rotach 
asymptotics to derive an approximation to the integrand valid as n — > oo. 



12.2. Application of Plancherel Rotach Asymptotics to the First 
Moment. Let's use Plancherel Rotach to estimate E(G^j tn ). Unlike other 
systems of orthogonal polynomials (such as we will encounter in the case of 
universal unitary ensembles), the Hermite polynomials can be differentiated 
easily. For fixed k as n — > oo: 



n—k \ 



2(n - k)H n _ k ^(x) = 2nH n _ k _ l (x)(l + 0{rT 1 )). 



This shortcut for differentiating Hermite polynomials allows us to simplify 
the integrand from the most recent formula for E(G~i n ): 



E(G 



o 



|/| 7 5 n 3 + |j| 7 9 n 5) + f ^ 

J I 3 



-2t l 



2 2n (n-l)\ 2 TT 



n 



(i + o^- 1 )) 



+ s(H n -iH n -s — H n H n -4)(H n _ 1 
—(H n H n s — H, 



iH n -2) 2 



dt. 



Notice that we factored the (1 + 0(n 1 )) outside of the brackets in order to 
save space. Technically this is dangerous because of the possibility that the 
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terms inside brackets interfere destructively, but we will see that destructive 
interference does not occur in our case. 

We use the Plancherel Rotach asymptotics in Theorem 112.31 to write the 
Hermite polynomials as an envelope times a phase function and estimate 
each of the above groupings. For the first grouping, the result is: 



(7/„-. L //„- l -//„//„- 2 )-r'-2"(/ J - I (l-£ 

1 2n 



2 



X 



[»(*(, n - l)) 2 - «))»($(*, n - 2)) + O^ 1 )] . 



We will now use Lemma 112.41 Let 

u = Q(x,n) 



x — i 



y/2n — x 2 



>2n 



The in terms of these variables, 



$(x,n) = u 
<£(x, n — 1) = uv 
<£(x, n — 2) = uv 2 

Using the identity 5ft [uv] 2 — 3f?[ii]5i[iii> 2 ] = Q[v] 2 for unimodular u and v, we 
have 

(H 2 _iH n H n _ 2 ) = e t2 2 n (n - l^V^ (l - |- V [l + C^n" 1 )] . 

Applying the same techniques to other groupings, 

{H 2 n _ x - H n H n „ 2 ) = e'V(n - l)"-^ 1 "^- 1 {s 2 ) [l + O^ 1 )] 
(^ 2 -^-iM = e t2 2 n -\n-2T- 2 e 2 - n s- 1 {s 2 )[l + 0(n- 1 )] 
(#„-i#n-3 - H n H n ^) = e t2 2 n -\n - 2) n - 2 e 2 ~ n s' 1 (3s 2 - 4s 4 ) [l + 0(r 

(tf n tf n -3 - Hn-xHn-2) = e'V-§n§(n - h^h^s' 1 (-2cs 2 ) [1 + 0( 



where 



n 



t 2 

1 

2n 



t/V2n. 
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Incorporating these newly derived asymptotics into the recent expression for 
E{G 1> i >n ) yields: 



E(G y ,j, n ) = 0(/ 7 V + |/| 7 V)+^^ S - 2 (l + 0(n-^ 
= O (l 7 V + |I| 7 V) + J l2^! s -2 (x + o^" 1 )) 



§(s 2 )(3s 2 -4 S 4 ) 
(-2cs 2 ) 2 + 0(n- 1 ) 



2(0(**) 



dt 



dt 



Recall the Wigner Semicircle law from Appendix 112.51 which says that 

/2n 



K n (t,t) 
We therefore see that: 



t 2 



E(G 7 ,/,„) = O (; 7 5 n 3 + |/|7 9 n 5 ) + ^j- j K n (t,tf(l + O^r 1 ))*. 
This agrees with our heuristic prediction. 



12.3. Controlling the Derivatives of K n (x,y) in the GUE Case. 

The error terms in our estimates of the moments of E(G^ 1 n ) are bounded 
in terms of derivatives of K n {x, y). We now bound these derivatives. Unlike 
the projection kernels of universal ensembles to come, we can now exploit a 
shortcut when differentiating the orthogonal polynomials. In this case the 
orthogonal polynomials are Hermite polynomials, and the shortcut is 



d_ 

dx 



H n (x) = 2nff n _i(x). 



The following Theorem is the GUE version of Lemma 13.11 
Lemma 12.2. Fix e > 0. For (x,y) in the region 



R 



1 - e)\/2n, (1 - e)V2n x [-(1 - e)V2n, (1 - e)V2n 



a mixed partial derivative of K n (x,y) of total degree k is O 2 j. When 

\x — y\ > n~2 ; we have the stronger bound K n (x,y) = O ( j^z^ n%^J . The 
constants implied by O depend only on k and e. 

Proof. Recall the closed form expression for K n (x,y): 

e 2 H n (x)H n _i(y) - H n ^i(x)H n (y) 



K n (x,y) 



2 n (n - 1)!Vtt 



x-y 
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Take k partial derivatives of K n (x, y). The result is a sum of several terms 
of the form: 

(x 2 +y 2 ) 

e 2 p ko {n)q kl (x,y)H n _ k2 (x)H n _ k3 (y) 



2 n (n-l)! v / ? (x-y) k * 

where p ko (n) and q kl (x, y) are integral polynomials of total degrees ko and k\ 
respectively. Before taking any derivatives, i.e. k = 0, there are two terms, 
each with (ko, k±, + &3, /C4) = (0, 0, 1, 1). Each new partial derivative can 
hit in one of four places, and increments the worst-case vector (ki, /C2+/C3, £4). 
Here are the four cases: 



• It hits e 2 . The increment is (0, 1,0,0). 

• It hits q. The increment is (0,-1,0,0). 

• It hits a Hermite polynomial. The increment is (1,0,1,0). 

• It hits the denominator. The increment is (0,0,0, 1). 

We divide the region R into two regions 

Rdiag = Ux,y)ER\x-y\<^=\ 

1 

\x-y\>—= 

Our estimates are straightforward in the bulk region Rbulk- Consider any 
of the finitely many terms resulting from taking k partial derivatives of 

K n {x,y): 



Rbulk = \(x,y) £ R 



Pk (n)q kl (x, y)H n ^ k , 2 (x)H n _ k3 (y) 




[x - y) 



/t'4 



, ki n_ n-fc 2 _ n n n — fcg _n y 

n °K 2 2 2 n 2 e 2e22an 2 e 2e2 



\x — y\n 2 



1 2k +k 1 -k 2 —k 3 + k i 

tU 2 



\x - y\ 



„ / 1 i+fe 

O -. tU 2 

\\x-y\ 
O(n^) 



Estimates for the diagonal region Rdiag &re more subtle. 
Consider the term 

_ (x 2 + y 2 ) 

e 2 p ko (n)q kl (x,y)H n _ k2 (x)H n _ ks (y) 
2 n (n - l)!^/vr (x - y) k ± 
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We expand H n - ks (y) and q kl (x,y) as a Taylor series centered at x: 
qkA x >y) = Qk 1 (x,x) + (y-x)(d 2 q kl (x,x)) + ... 



+ 



^— - (d k 2 *-\kAx,x)) + (^(X.X!)) 



(fc 4 - 1)! V 1 ™ iy ' ') k 4 \ 
H n -k 3 (y) = H n _ k3 (x) + (y - x)H' n _ ks (x) + . . . 

(y-x)^- 1 TTk4 . lf . , {y-x) k ± 



for some X\ G (x, y) and X3 G (x,y). Here's what happens if we select the 
remainder term from the expansion of H n _ k3 (y), and a lower order term, 
< j\ < k 4 , in the expansion of q kl (x,y): 

PkM (irfi (dtq kl (x,x))) H n _ k2 (x) (^H k n t ks (x 3 )) 



e 2 



2 n (n - 1)!aA (x-y) fe4 

/ l+2fc + fc 1 -j 1 -fc 2 -fc 3 + fc 4 \ / l+fc-3l \ 

= O In 2 =0 n 2 I 

Here's what happens when we take a lower order term, < j'3 < k 4 , from 
the expansion of H n _ k3 (y), and the remainder term for the expansion of 
Qk!(x,y): 

PM (dtq kl (x,x))) H n _ k2 {x) (^Ht k3 (x 3 )) 



e 2 



2 n (n-l)! v / 7r (x-y) fc 4 

/ l+2fc + fc 1 -fc 2 -fc 3 \ / l+fc-fc 4 \ 

= O ( n 2 =0 n 2 

A term with lower order terms j\ and j'3 with j\ + j'3 > &4 also make a small 
contribution, 

/ l+2fc +fc 1 -fc 2 -fc 3 +fc 4 -j 1 \ / 1+fc-J! \ 

O ( n 2 =0 n 2 . 

Most subtle of all are the remaining terms 

e -^l Pk0 (n) (iffii (d?q kl (x,x))) H n ^(x) (jgggt^x)) 
2 n (n- 1)!aA " (x-y) fc 4 

where ji + j'3 < /C4. Adding all of these terms together, we obtain, for each 
1 < I < k 4 , a polynomial in x times 



e 2 
(x - y) r 

Since K n (x,y) is smooth, there are no singularities on the diagonal, and 
all these polynomials in x must vanish identically. Thus all the terms with 
ji + J3 < &4 cancel out against each other. □ 
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12.4. Derivation of P lanche r el- Rot ach Asymptotics. We will use 
the method of steepest descent (see [BQ78] ) to prove the Plancherel Rotach 
asymptotics for Hermite polynomials. 

Theorem 12.3 (Plancherel- Rotach) . Fix e > 0, and let (x,n) be a pair for 
which \x\ < (y/2 — e)-*/n. Then: 

H n (x) = e ~2~n2e~2 f 1 - — j [Re($(x, n)) + 0(n L )\ , 
where the phase <3?(x,n) is: 



<f>(x,n) = e i l [ _^ = + iWi_|_ | ( ~ 
yV2n V 2n 

The constant implied by O depends only on e. 



Proof. Begin with the well-known integral representation for Hermite 
polynomials: 

2 n f'°° 2 

H n (z) = -= / (z + it) n e- 1 dt. 



One may verify this integral representation by checking that Hq(x) and 
H\{x) are correct, and then verifying the three term recurrence H n (t) = 
2tH n -\(t) — 2(n — l)H n -2(t) for Hermite polynomials. 

First change variables so that the locations of the saddle points will remain 
constant as n — > oo. Let w = and u = -7=. The integral representation 
becomes: 

n+l „ 
on„-i- /"oo 



For fixed w S (— (v2 — e), \[2 — e) let us apply stationary phase to the above 
integral in the n — > oo limit. The saddle points occur when — 2u = 0, 

° c W + IU ' 

or 

- (iw ± \pl — w 2 



U± 2 

The second derivatives of the phase function at these points are 



f± = 2\w 2 -2=f iwy/2 - w 2 
The absolute value of f± is: 



2^4-2^2. 

The slope of the stationary phase contour at the saddle point u + is 



V i/+r 

with the sign chosen to have positive real part. 
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saddle points 



/\imag 




> > 




Figure 3. The contour for the integral representation of 
Hermite polynomials deformed so that it passes through the 
saddle points. 



The value of the integrand is v± , where 



The integral representation for H n {^Jnw) has the real axis as its contour of 
integration. We change this contour to a stationary phase contour, that is 
Q(log(w;+wt) — u 2 ) = const, which passes through the two saddle points. The 
stationary phase contour begins at — oo, hits the left saddle point, crosses 
over to the right saddle point, then goes to +00, and overall looks a bit 
like a Gaussian. Reducing to small neighborhoods S- and S + of the saddle 
points produces only exponentially small errors. 

The contributions from the two neighborhoods are complex conjugates of 
each other, so we focus on S + : 
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H n (yjnw) = 



2K 



n + l 



2 n n— 



7T 



2 n n~ 



7+ 27T 



(i + o^- 1 )) 



,,21+1 » I n f«; 2 -11 / i ^ 

2 2 n 2 e 2 n( . w 'Ml 

2 



x 3? 



u> iy/2 — w 2 



+ 



y/2 y/2 



— ^iwny/2—w 2 ^ 



y/2 — w 2 iw 
y/2 + V5 



Now recall that w = -?=, and notice that two of the terms in the phase 
function can be combined: 



•ill n » : / z 2 \ 4 
if n (z) = 2 2 fiie 2e2 (l-_ 



x 5ft 



e 4 



+ n/l - — ) e -|«V2n-^ + ( n - 1 ) 
2ra V 2n 



□ 



We'll often use Plancherel-Rotach by comparing H n (x) for consecutive val- 
ues of n. The following simple Lemma describes the way in which varies 
with n: 



Lemma 12.4. Fix e > 0. Then for pairs (x, n) for which \x\ < (y/2 — e)y/n, 



<&(x,n+l) _ / x + iy/2n - x 2 
<f>(x,n) I y/2n 
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Proof. 



2 



<j?(x, n) 



t ! z 2 z z 2 



1 + i—=\ l i , \ 1 + 0(n~ 2 ) 

V^V 2n + 2 v^n + 2V 2n V 7 



L z 2 



V2n + 2 V 2n + 2 



+ e 2V2n- z 2 + o(n _1 ) 



: . / Z 2 



- + H 1-^ + 0(n~ 1 ) 
2n V 2n / 



□ 



12.5. Derivation of Wigner Semicircle Law using Plancherel 
Rotach Asymptotics. We use Plancherel Rotach asymptotics, Theorem ll2.3l 
to prove the Wigner semicircle law. This is similar to our estimation of 
j y n) using Plancherel-Rotach asymptotics. 

Lemma 12.5. For fixed e > and x G [— (1 — e)y/2n, (1 — e) v2re] , 
diagonal of the kernel K n (x,y) satisfies the following asymptotics osn-> oo: 



K n (x,x) = ^Jl-^(l + 0(n- 1 )). 
7r V 2n 



Proof. Since K n (x,y) is smooth everywhere, it is continuous on the 
diagonal x = y. By continuity, 

K n (x, x) = lim K n (x + h, x) 

/i-»0 

e"^ ( H n {x + h)H n ^{x) - H n ^(x + h)H n (x) 

- lim 



h^o 2 n (n - ly.y/n V /l 

(2nF n _x(x) 2 - 2(n - l)tf n _ 2 (x)tf n (x)) 



e 21 



2 n (n - l)!0r 

_ i 

" / K($(x,n-l)) 2 -5R($(x,n))K($(x,n-2)) + 



f l-t~^ 2 fK(1>(x,n-l)) 2 



7T \ 2ny \ 
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We will now use Lemma 112.41 Let 

u = $(x,n) 



(x — iy/2n — x 2 \ 
)' 

The in terms of these variables, 

$(x,n) = u 
n — 1) = uv 
$(x, n — 2) = uv 2 

Using the identity 3ft [uv] 2 — ^ft[u\^R.[uv 2 ] = ^s[v] 2 for unimodular u and u, we 
have 

i 

K n (x,x) = ^^(1-^) ( Vt(uv) 2 - $t(u)$t(uv 2 ) + O ' 







9 

X 


7T 


(- 


2n 




(- 


X 2 


7T 




2n 



n 



i 

\ 2 A ~ / I 

n 



□ 



13. Appendix: Weyl integration 

The following discussion of Weyl integration is from !Meh9l] . p. 62-63. 

Lemma 13.1. Suppose that an ensemble of Hermitian matrices has the 
following joint probability density function for the matrix entries: 

C n e- n ^ v ^dM, 

where V{x) is a potential, the 9{ are eigenvalues, and C n is a normalization 
constant. Then the joint probability density function for the eigenvalues is 

i<j 

where C n is a new normalization constant. 



Let U be a unitary matrix and be a diagonal matrix so that 

h = ueu*. 

Except on a set of measure zero, the eigenvalues will be distinct. Assume 
without loss of generality that the diagonal entries of are in ascending 
order. We also assume without loss of generality that the first nonzero entry 
in each column of U is positive real. With these conventions, the and U 
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are uniquely determined by H , and H is of course uniquely determined by 
U andG. Let p^ be n(n — 1) real variables which specify a unitary matrix. 



If one takes J(0,p) above, and integrates out the variables p, then multi- 
plying by the joint probability density function for matrix entries, one has 
the joint probability density function for the eigenvalues themselves. It will 
turn out that J(9,p) is IIj<j(^j — ^*) 2 times a function of p, so that all that 
remains is a function of 9 times a constant. 

We will multiply the above n 2 x n 2 matrix for J(9,p) by another matrix 
of the same dimensions depending only on U, and the determinant of the 
result will be Y[i<j(@j ~ ®i) 2 times a function depending only on U. 

Now UU* = 1, so for each variable p M , 



J(9,p) = det 



f)(rr(°) h-(°) h-W ^ 

u \ -"11 ' • • • ' rL nn i j "12 ■>•••) 



8 (6>i, 6> 2 , • • ■,9 n ,p 1 ,p 2 ,.. ■ ,P n (n-l)) 




dU* 



dPn 



U 



is conjugate Hermitian. Then 



u*—u = s^e 



es>* 



or 




Similarly, 




Writing these two equations in matrix form, 
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jk 


}k 


ae 1 

jj 




<90 7 


<9p M 


<9Pm 





Q 

+ u, 



a,k 



UjM *t[KjU k ,p + u* k u j , i 



10 

M(S^)(8p-9 a ) 9(S^)(fy-0 a ) 



14. Appendix: The Method of Gaudin 



Suppose that F mtn (ti, t%, . . . , t n ) is a symmetric function of n variables, 
which only depends on the variables m at a time. Applying F m ^ n to the 
eigenvalues of an n x n matrix chosen randomly from some ensemble makes 
F m ^ n a random variable. We express the expected value of F m>n as an m 
dimensional integral. 

14.1. The Universal Unitary Ensemble. Let V{x) be a real-analytic 
potential function growing sufficiently rapidly at infinity. Then for the en- 
semble UUE n , the j.p.d.f. of the eigenvalues is 

0<j<fc<n 

where K\y^ n is a constant depending only on V{x) and n. See Appendix PT3l 
This leads to an expression for the expected value of F m ^ n . 
E{F m 

,n) — Kl,V,n / F mn (ti, . . . jtn) 
0<j<k<n 

Theorem 14.1. Let rn > 1 and F a function of m variables. Define the 
symmetrization of F to n variables as: 

Fm,n(tl j • • • > tn) = ^ ^ -^(^ii ) ^2 ) • • • ' ^im) - 

ii,!2,...,i ra distinct 

Then the expected value of F m>n , applied to the eigenvalues of a random 
unitary matrix from UUE n , can be expressed as an integral in m variables: 

E(F mn ) = / F(t 1 ,t 2 , ■ ■ ■ ,t m ) det [K n (tj,t k )\ dt\ . . . dt m . 

J mxm 

The projection kernel K n (x,y) appearing inside the determinant will be de- 
fined below. 
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14.2. Definition of the Projection Kernel K n (x,y). We form the 
sequence of normalized orthogonal polynomials by performing Gram-Schmidt 
orthonormalization on the sequence 1, x, x 2 , x 3 , . . . , with respect to the inner 
product 



We always choose the sign of the leading coefficient of the polynomials to be 
positive. We call th resulting sequence (j) n j(x) the normalized orthogonal 



For each polynomial we have a choice of sign - we choose so that the leading 
coefficient of each polynomial is positive. Let the resulting sequence of 
polynomials be 4>j. Thus, 



An alternative normalization is to choose the leading coefficient of each 
polynomial to be 1. This would result in monic orthogonal polynomials. 
We will use 4> n j to denote normalized orthogonal polynomials and TT n j to 
denote monic orthogonal polynomials. When the context is clear, we will 
omit the subscript n. 

The most familiar examples of orthogonal polynomials are Hermite poly- 
nomials. Hermite polynomials are orthogonal polynomials with respect to 

2 

e~ x dx. They are normalized so that the leading term is 2 J x J . 

Lemma 14.2 (Christoffel-Darboux + three-term recurrence relation). Us- 
ing the above notation, let 




polynomials with respect to the measure e 



-nV(x) d , 






for k>0. 



Then 



x4>o(x) 
x<t>k{x) 



bo4>i{x) + a 4>o(x) 

b k (f> k+1 (x) + a k (j) k (x) + b k -i(f) k -i{x) for k>\. 



and 



n-l 



(f) n {x)4> n ^i(y) - n (y 1(2;) 



c/) j (x)(f> j (y) = b n -i 



x-y 



j=0 



Proof. To prove the three term recurrence relation, observe that x<p k {x) 
is a polynomial of degree (k + 1), so it is orthogonal to all polynomials of 
degree higher than (k + 1). It is also orthogonal to polynomials of degree j 
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less than (k — 1): 

J (x(p k (x))(j) j (x)d^(x) = J (f> k (x)(x(f)j(x))dfj,(x) = 0. 

The second integral above is zero because (xcf>j) is a polynomial of degree 
at most (k — 1), and (/>k(%) is orthogonal to polynomials of such low degree. 
The Christoffel-Darboux formula itself is proven by induction. The inductive 
step is the following: 

i \ fu ( l ) k+i(x)4 , k(y) - 4>k+i(y)4>k(x) 
{x-y) [ bk 



x-y 

, <t>h{x)<t>k-x{y) - <t>k{y)<t>k-x{x) 
x-y 

(b k ct)k+i{x) + b k -i4>k-i{x)) 4>k{y) 

-4>k(x) (b k 4>k+i(y) + h-i4>k-i{y)) 
(b k (j) k+ i(x) + 6fe-i0fc-i(x) + a k (j) k (x)) 4> k {y) 

~4>k{x) {b k (f) k+ i(y) + b k ^i4> k ^i(y) + a k (f> k (y)) 
x(j) h {x)(j) k (y) - 4> k (x)ycf) k (y) 
(x - y)4>k{x)(j) k {y) 



□ 



For computations elsewhere, we find it more convenient to use r] n j(x), where 

Vn,j{x) 



4>n,j(x) 



The advantage of using rj n j(x) is that they are orthonormal with respect 
to Lebesgue measure on 1Z. Again, we will omit the subscript n when the 
context is clear. Using this notation, let 

n-l 

K n (x,y) = J2vj{x)Vj(y) 

, T]n(x)Vn-l(y) ~ Vn(y)Vn-l(x) 
= O n -\ . 

x-y 



14.3. Proof of Theorem 114.11 We will use the following Lemma to 
express E(F m ^ n ) as the integral of an n x n determinant: 

Lemma 14.3. Let K n {x,y) be the projection kernel above. Then: 
TT (tj - t k fe- n ^ v ^ = K 2 2Xn det[K n (tj,t k )], 

' ' nxn 

0<j<k<n 

where K2,v,n depends only on the potential V{x) and n. 
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PROOF. We recognize the component Ylo<j<k<n(^j — t k ) as a Vander- 
monde determinant: 



H (tj - t k ) = det 

i<j<k<n 



n— 1 



i h...q 



i / f n ~ l 



Performing column operations we replace the monomials tj with orthogonal 
polynomials i^ n k{pj)- To replace the monic orthogonal polynomials 7r n & 
with the normalized orthogonal polynomials n ,fc> we divide by the leading 
coefficients. We next replace <j)j with m in order to absorb a factor of 



e 2 



n fe-^- jEvw 

l<j<k<n 

7T0(*l) VTi(ti) . . . 7T n _l(tl) 



det 



e 2 



TTo(^n) 7Tl(t n ) . . . 7T n _i(tAr) _ 

0()(*l) 01 (*l) • • • 071-1 (*l) 



«2,v> det 



«2,v,ri det 



0o(*n) 0l(*n) • • • 0n-l(*jv) 
7?o(*l) »7l(*l) • • • »7n-l(*l) 



. ?7o(*n) r/l(tn) • • • %-i(*at) 
«2,v,n det[M]. 



Squaring the above formula, 



J] ih-t k fe~^ v ^ 

l<j<k<n 

= 4y,„det(M) 2 
= n\ v>n det(MM T ) 2 
= K l,y,n det [K n (tj,t k )}. 



□ 



We now use the results of Lemma ll4.3l to express E(F myn ) as the integral of 
an n x n determinant: 



E(F„ 



v 2,Kn 



Em,n 
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Unfortunately, this is an n dimensional integral instead of m dimensional. 
We use the next Lemma to "integrate out" the extra dimensions one at a 
time. 

Lemma 14.4. Let K n (x,y) = Yl^o Wji^Vjiv) be the projection of n func- 
tions orthonormal with respect to Lebesgue measure. Then integrating 

det[K n (tj,t k )] 

Ixl 

over the last variable t\ is the same as multiplying by (n — I + 1). That is, 



f 

j — i 



det[K n (tj,tk)]dti = (n-l + l) det [K n (tj,t k )]. 
Ixl (l— l)x(i— 1) 



Proof. Expand by minors along the last column. The last entry in this 
column contributes 



det [K n {tj,t k )]( f t,)d*i 
(Z-l)x(J-l) \J 

(J-l)x(J-l) 



Now for the (m — 1) other minors. Consider the minor by expanding along 
K n (tj,ti), where 1 <j < (Z - 1): 

K n {h,h) K n {h,t 2 ) ... K n {h,ti-i) 



det 



K n (tj-i,ti) K n {tj-\,t 2 ) 
K n (tj + i,ti) K n (tj +1 ,t 2 ) 

K n (ti,h) K n {t u t 2 ) 



K n (tj-i,ti-i) 
K n {tj+i,ti-i) 

K n (ti,ti-i) 



K n (tj,ti)dti 



Each of the (I — 1)! terms in the determinant has exactly one term in row 
I, say K — n(ti,ti), but otherwise has no dependence of t\. To integrate this 
term against K n (tj,ti)dti, observe that K n (x,y) is a projection kernel: 

K n (x, y)K n (y, z)dy = K n (x, z). 
The resulting contribution is: 

#„(tl, tl) K n (h,t2) ... K n (ti,ti-i) 



(-l)^'det 



K n (tj-i,ti) K n (tj-i,t 2 ) ■ 
Kn(tj+i,h) K n (tj +1 ,t 2 ) . 

K n (tj,ti) K n (tj,t 2 ) . 



K n (tj-i,ti-i) 
K n (tj+i,ti-i) 

K n (tj,ti-i) 



det 



K n (ti-i,h) K n (ti-i,t 2 ) ■ ■ ■ K n (ti-i,ti-i) 
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Adding the contribution from the last minor to the other (m — 1) minors 
yields the conclusion of the Lemma. □ 



Using Lemmas 114,31 and 1 14 , 41 allows an induction on the following statement 
for m < I < n: 

E{F mtn ) = Kl > V ' n ; n ~ l \. [ F{t 1 ,t 2 ,...,t m )det[K n {t j ,t k )]dt 1 dt 2 ...dt l 



To prove the base case I = n, observe that F m ^ n is a sum of faj^ji terms. 

Each of these terms contributes as much to E(F min ) as the term F(ti,t2, ■ ■ • , t r 
does. Specializing to the case I = m, we obtain: 

E(F m n ) = l ,V ' n / F(t 1 ,t 2 , ■ ■ ■ ,t m ) det [K n (tj,t k )]dti . . .dt m . 
K 2,v,n J mxm 

This proves Theorem 114. 1\ except for the presence of a multiplicative con- 
stant depending only on V and n. To find this constant, let F(x) = 1 be a 
function of one variable. Then 

E{F m ^ n )= 1 = n 

l<3<n 

and 

J K n (x,x)dx = n, 
establishing that the multiplicative constant is 1. 



14.4. The Circular Unitary Ensemble. Applying the method of 
Gaudin to the circular unitary ensemble (CUE) yields the following Theo- 
rem, which differs from Theorem 1 14 . 1 1 only in the definition of the projection 
kernel K n (x, y). 

Theorem 14.5 (Method of Gaudin, CUE version). Let m > 1 and F a 

function of n variables. Define the symmetrization of F to n variables as: 

F m ,n(t\ ,t 2 ,---, t n ) = ^ 1 E(ti 1 , ti 2 , . . . , ti m ). 

ii,i2,—,im distinct 

Then the expected value of F m ^ n , applied to the eigenvalues of a random 
unitary matrix from U n , can be expressed as an integral in m variables: 

E{F m)n ) = / F(ti,t 2 , . . . ,t m ) det [K n (tj,t k )]dti...dt m , 

where the projection kernel is 

1 e in(x-y) _ i 

K n (x,y) = -, r . 

27r e< x -v) - 1 
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Proof. For the CUE, the j.p.d.f. for the eigenvalues is 



ni iOi id k 
\e 3 - e k 



j<k 



j<k j<k 




1 



— K 3,n 





•n 



= K 4 ,n det[K n (6j,6k)], 



nxn 



where 



K n (e 1 ,e 2 ) 



I e in(0i-$2) _ ]_ 



2vr eWi- *) - 1 ' 



Proceeding as in the proof of Theorem 114.11 one must verify that K n is a 
projection kernel: 



It is this requirement which dictates the choice of normalization constant for 
K n . The remainder of the proof is identical to the proof of Theorem ll4.1[ □ 



15. Appendix: One Possible Approach to the Maximum Spacing 

Problem 

In this paper, we considered the following question: given an interval I and 
an ensemble of random matrices, what is the minimum spacing between two 
consecutive eigenvalues in the interval 7? 

It is natural to ask the same question about maximum spacing: what is the 
maximum spacing between two consecutive eigenvalues in the interval 7? 

We suspect that maximum spacing is a more difficult question than mini- 
mum spacing; at least we have not been able to deal with it yet. We begin 
with heuristic predictions based on the tail probabilities of the GUE consec- 
utive spacing distribution. As far as we can tell, even these tail probabilities 
have not been proven rigorously. 

In a region of unit mean spacing, in an appropriate limit, the probability 
that an interval 7 is free of eigenvalues is the Fredholm determinant of the 
integral operator 




(Id - Axi/j) (x) = i/)(x) - J K(x, y)ip(y)dy, 
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where K(x,y) is the sine kernel 

sin(x - y) 

K[x,y) = — r . 

ir(x -y) 

In the above statement and formula, we could replace I by a union J of 
intervals. See }Wid95j Let F(J) be the value of this Fredholm determinant. 

For a single interval, F(I) depends only on the length of the interval s = \I\. 
Let E 2 (0;s) be this probability. Then the consecutive spacing distribution 
is 

d 2 

p 2 (0;s) = —E 2 (0;s). 

See [Meh91], Chapter 5 and Appendix 13. In the minimum spacing prob- 
lem, we used the Taylor expansions of P2{0; s) centered at s = 0: 

„ , s 7rV 7T 4 S 6 ^ 6 S 8 

E ^ 2) = 1 - S + ^-^75- + 17640 + --- 

, , tt 2 s 2 2vr 4 S 4 ^ 6 s 6 

p 2 (0;s) = 1 h... 

F v ; 3 45 315 

For maximum spacing, we require the asymptotics of p2(0;s) for large s. 
Dyson |Dys76| derived the following expansion for large s: 

log(£ 2 (0; a)) = -J~\ Ms) + 3C'(-1) + i log(2) + o(l). 

Unfortunately Dyson's methods did not yield rigorous asymptotics. The 
first rigorous asymptotics are more limited, and were obtained only recently 
bv Widom [Wid94j : 

^log(£ 2 (0; S )) = J + 0(1) 

log(E 2 (0;s)) = ~ + 0(s) 

Since we are only doing heuristics, we choose to use Dyson's expansion. 
Truncating this expansion and differentiating twice we obtain asymptotics 
for p 2 (0; s) for large s: 



P,(P;t)dt 
P2(0;s) 




#+3C'(-l)+ilog(2) 



£+3f'(-l)+ilog(2) 



2 +3C'(-l)+ilog(2) 



As in the minimum spacing problem, we assume that the consecutive spac- 
ings are independent random variables, aware that the assumption is false 
but convenient. Recycling notation from the minimum spacing problem, 
let Gjj jn be the number of consecutive spacings, of a random matrix from 
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an ensemble of n x n matrices, in I n , which are larger than 7. Let Z n be 
the maximum spacing itself. Then, assuming a sum of independent unlikely 
events, G-yj jH will be approximately Poisson with mean 

H = J K n (x, x) (7K " ( 4 X ' io g(2 ) dx 

Then Pr(Z n < 7) = Pr(G 7i /, n = 0) « is the probability that the 
maximum spacing is less than 7. We specialize to the case of constant 
eigenvalue density 1, so that 

u= |/|2i e -^+3C'(-l)+|log(2) 

p 1 1 16 



Our proposed strategy for recovering Pi(Z n < 7) is less direct than in the 
minimum spacing problem. Let Sjj jTl be the set 



[x, x + 7] C I and 
[x, x + 7] contains no eigenvalues. 

x £ I and 
[x, x + 7] contains no eigenvalues. 



Let X 7 = l/S-y / n | be the size of the set Sjj tn . We wish to estimate Pr(Xy = 
0), which we bound using the following inequalities: 

Pr(X 7 _ e < e) < Pr(X 7 = 0) < Pr(Xy < e). 

It may be possible to estimate Pr(Xy < e) and Pr(X 7 „ e < e) using estimates 
for the moments of X 1 and X,,_ e , respectively. 

The (fc)th moment of X 7 is 

F ([x\,x\ + 7] U • • • U [x k , x k + 7]) dxidx 2 ■ ■ ■ dx k . 



1 

J i k 



We expect that the dominant contributions to this integral occur when the 
"clustered together" according to some partition of {1,2,... , k}. For 
example, consider the contribution to E(X^) corresponding to the partition 
(13|2|4). The region ^(i3|2|4) C I 4 corresponding to this partition is 



%3|2|4) C I 4 = < (X 1 ,X 2 ,X 3 ,X4) C I 4 





Xl 


- x 3 


small "| 




Xi 


- x 2 


large > 




Xl 


— X4 


large J 



Since \x\ — X3I is small, [x±, x\ + 7] U [x 3 , x 3 + 7] is a single interval of length 
7+|xi — x 3 \. Thus F([xi,xi+7]U[x 3 ,X3 + 7]) = E 2 (0; 7+ \x\ — x 3 \), which 
is easily calculated and compared to £2(0; 7). 

When intervals l\ and I 2 are well separated, it should be the case that 

U I 2 ) « F(h) ■ F(I 2 ), 
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and similarly for more than two intervals. We call this approximate splitting. 
Since \x\ —X2\ and \x± — x&\ are large, we have three well-separated intervals. 
Thus, 

F([x 1 ,x 1 + 7] U [x 2 ,x 2 +7] U [x 3 ,x 3 + 7] U [x 4 ,x 4 + 7]) 

~ F([xx,X! +7] U [x 3 ,x 3 +7]) • F([x 2 ,x 2 +7]) • ^([^4,^4 + 7]) 
&E 2 (0;i + \ Xl -x 3 \) x£ 2 (0; 7 ) x£ 2 (0; 7 )- 

The integral over fi(i3[2[4) can then be approximated by a product of three 
more easily evaluated integrals. 

We call the readers attention to [Wid95 . It seems that these asymptotics 
are for unions of intervals like [0, n]U [2n, 3n] . That is, the ratios of length and 
separation remain fixed and x — > 00. For approximate splitting, however, 
we require asymptotics where the intervals are of fixed length and they are 
moved far apart. 

Before attempting to show that the remainder of I k makes a negligible con- 
tribution to E(X^), one would first have to quantify "large" and "small" in 
the definition of fi(i3|2[4) and the in the regions corresponding to other par- 
titions. The definition of "small" would depend on how rapidly E 2 (0; 7 + e) 
decays. The definition of "large" would depend on the details of the approx- 
imate splitting conjecture. 

Here is one approach to proving the approximate splitting 

F(hUl 2 ) « F(h) - F{I 2 ). 

Let cei2,i be the eigenvalues of Aj lU j 2 , aij the eigenvalues of A^, and a 2> & 
the eigenvalues of Aj 2 . We must show that 

n(! - «i2,<) ~ n^ 1 - a ^ n^ 1 - <**,*)■ 

i j k 

Let us take for granted that all of the a are positive and less than 1; perhaps 
this is something that could be proven. 

We expect that for the large a±j and a 2t & there correspond a.\2,i- If this 
were true for all a, it would prove approximate separation. Regarding the 
1 — 1 correspondence, there is a subtlety which we illustrate in Figure |U 

It seems plausible that the large aij are well separated from other atijf, 
and that this could be proven. Similarly for the a 2 k- However, there is no 
reason why one of the a±j cannot be close to one of the a 2t k- If we only 
have assurances that one eigenvalue a% 2 % is nearby, then we are missing one. 
We therefore need a multiple eigenvalue version of Weyl's Criterion: 

Weyl's Criterion: Let T be a self adjoint operator on a Hilbert space and 
ip and A such that \Tip — Xip\ < where A is real. Then the intersection 
of the spectrum of T with the interval [A — e, A + e] is nonempty. 
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a_12 
a_l 
a 2 



oo co (s o — e+e- 



-e- 



o o 



-m- 




no control; 
eigenvalues 
too close 



Each small circle on the top line is guaranteed to 
contain one eigenvalue. Each large circle is guaranteed 
to contain two eigenvalues. 



Figure 4. The correspondence between eigenvalues of Aj i: 
Ai 2 , and Ai lU i 2 . When the circles are disjoint, there is no 
danger of confusion and we have a one-to-one correspondence 
of eigenvalues. 



Embarrassingly, the author has been unable to find a reference for Weyl's 
Criterion above, or the following statement. We do not know whether the 
following statement is true as stated or whether or not it is known. 

Statement: For e > sufficiently small, there is a constant C e so that 
the following is true. Suppose that T is a self-adjoint operator on a Hilbert 
space %. Suppose tpi,ip2 £ H and A G 1Z are such that: 

\Tipi - Ai/>i | < e\ipi\ 
\Tip 2 ~ AV>2| < e|V>2| 
IOi,V>2)| < e|V>i| • |^2 1 

Then the interval [A — C e , X + C e ] contains at least two eigenvalues of T. The 
constants C e may be chosen so that C e — > as e — > 0. 

To apply the multiple eigenvalue statement to the current situation, T = 
Ai lL> i 2 , and A be the average value of two nearby eigenvalues a±j and «2,fc- 
Let tpi and ip2 be the corresponding eigenfunctions for Aj 1 and Aj 2 . Then 
should be localized around I\ and ip2 should be localized around 12- Hence ip\ 
and ip2 are nearly orthogonal, and also Aj 2 ipi and Ai x ip2 are small, satisfying 
the hypotheses of the multiple eigenvalue statement. 

Thus for large a, there is a one to one correspondence between eigenvalues 
of either Aj l or Aj 2 and eigenvalues of Aj l{J j 2 . For small a, however, this 
correspondence may no longer hold. Fortunately the small a cannot con- 
tribute much to any of the products Y\(l — a). The reason is that Aj lU j 2 
has trace |/i| + I/2I and also has, we hope, all positive eigenvalues. The large 
eigenvalues 0:12,1 have accounted for all but a small amount of this sum, and 
so the product Y\(l — 0112,1) over the remaining a is close to one. Similarly 
for the other two products. 
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